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INTRODUCTION: 


Background: 

Snail  and  Slug  are  master  regulators  of  the  epithelial-mesenchymal  transition  (EMT)  differentiation 
pathway  involved  in  neural  crest  development  in  embryogenesis,  which  is  also  critically  involved  in 
the  invasiveness  and  lethality  of  cancer  metastasis.  An  absolute  pre-requisite  for  undergoing  EMT  is 
the  down-regulation  of  E-cadherin,  claudins  and  other  epithelial  cell  adhesion  molecules,  which 
facilitate  cell  motility.  The  Snail  subfamily  (Snail,  Snai2,  Snai3)  are  activated  during  early  EMT  and 
bind  directly  to  target  promoters  containing  the  canonical  E-box  recognition  sequence:  CACGTG. 

The  Snail  protein  contains  four  COOH-terminal  tandem  C2-H2  ZF  motifs,  possessing  sequence- 
specific  DNA  binding  activity  and  anchoring  a  protein  complex  to  chromatin  at  specific  target  genes. 
One  of  the  most  commonly  derived  sequences  recognized  by  the  Snail  transcription  factor  family  is 
the  CAGGTG  E-box  sequence,  usually  found  in  multiple  copies  in  promoters  regulated  by  Snail. 
Induction  of  EMT  by  Snail  during  tumor  progression  is  virtually  a  universal  property  of  late  stage 
carcinoma  progression  and  metastasis  as  the  cells  become  motile,  invasive  and  highly  metastatic. 
Hence,  targeting  the  functions  of  Snail  superfamily  members  with  small  molecule  therapeutics  is 
highly  relevant.  In  order  to  understand  and  target  the  Snail  family  of  factors,  we  must  define  the 
complete  set  of  target  genes  that  are  bound  by  Snail  in  the  pro-metastatic  cell  undergoing  EMT. 
Moreover,  we  must  define  the  post-translational  modifications  that  occur  on  Snail  and  the  Proteins  to 
which  it  binds  during  EMT.  We  hypothesized  that  the  PAPSS1  and  PAPSS2  are  direct  target  genes 
for  Snail.  As  described  below,  we  have  made  the  most  progress  on  defining  Snail  binding  proteins, 
modifications  and  effector  pathways  in  EMT. 

Hypothesis/Objective: 

The  original  hypothesis  was  that  the  Snail  protein  induces  enzymes  which  result  in  changes  in  cellular 
sulfonation  and  that  this  is  a  required  part  of  EMT  during  metastasis  in  breast  cancer. 

Specific  Aims: 

The  original  Specific  Aims  were:  1)  test  the  hypothesis  that  the  PAPSS2  gene  is  a  direct  target  of 
Snail  and  that  it  is  required  for  maintaining  the  mesenchymal  phenotype  and  invasive  property  of  the 
breast  cancer  cells  during  EMT  and  2)  test  the  hypothesis  that  Snail-stimulated  sulfonation  of 
proteoglycans  (PGs)  are  crucial  for  the  induction  of  EMT  and  metastasis. 

BODY: 

Overview:  In  the  course  of  the  3-year  project,  we  considerably  broadened  our  work  to  include  other 
aspects  of  Snail  function  including  defining  DNA  binding  specificity,  target  gene  interaction,  post- 
translational  modifications,  and  protein-protein  interactions.  This  broad  approach  has  been  very 
successful  and  has  led  to  many  novel  discoveries.  This  broadening  of  the  project  was  approved  by 
the  DOD  and  was  described  in  our  annual  reports,  along  with  the  experimental  and  technical 
difficulties  in  addressing  the  original  hypothesis.  Both  the  successes  and  the  problems  are  detailed 
below. 

A  method  for  in  silico  identification  of  SNAIL/SLUG  DNA  binding  potentials  to  the  E-box 
sequence  using  molecular  dynamics  and  evolutionary  conserved  amino  acids. 

Binding  of  Snail  to  DNA  is  a  dynamic  process  allowing  for  spatial-  and  sequence-specificity  of  the  E 
box.  Many  methods  for  determination  of  DNA-protein  structures  do  not  allow  for  identification  of 
dynamics  of  the  search  process,  but  provide  only  a  single  snapshot  of  the  most  stable  binding.  In 
order  to  better  understand  the  dynamics  of  DNA  binding  as  a  protein  encounters  its  cognate  site,  we 
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have  created  a  computer-based  DNA  scanning  array  macro  that  sequentially  inserts  a  high  affinity 
DNA  consensus  binding  site  at  all  possible  locations  in  a  predicted  protein-DNA  interface.  We 
showed,  using  short  molecular  dynamic  simulations  at  each  location  in  the  interface,  that  energy 
minimized  states  and  decreased  movement  of  evolutionary  conserved  amino  acids  can  be  readily 
observed  and  used  to  predict  the  consensus  binding  site.  The  macro  was  applied  to  SNAIL  class 
C2H2  zinc  finger  family  proteins.  The  analysis  suggests  that  (1)  SNAIL  binds  to  the  E-box  in  multiple 
states  during  its  encounter  with  its  cognate  site;  (2)  several  different  amino  acids  contribute  to  the  E- 
box  binding  in  each  state;  (3)  the  linear  array  of  zinc  fingers  contributes  differentially  to  overall  folding 
and  base-pair  recognition;  and  (4)  each  finger  may  be  specialized  for  stability  and  sequence 
specificity.  Moreover,  the  macromolecular  movement  observed  using  this  dynamic  approach  may 
allow  the  NH2-terminal  finger  to  bind  without  sequence  specificity  yet  result  in  higher  binding  energy. 
This  macro  and  overall  approach  could  be  applicable  to  many  evolutionary  conserved  transcription 
factor  families  and  should  help  to  better  elucidate  the  varied  mechanisms  used  for  DNA  sequence- 
specific  binding.  Moreover,  this  analysis  should  allow  us  to  better  distinguish  real,  direct  targets  of 
Snail  and  Slug  during  EMT  than  previous  resolution  has  allowed. 

Prokop  JW,  Liu  Y,  Milsted  A,  Peng  H,  Rauscher  FJ  3rd.  A  method  for  in  silico  identification  of 
SNAIL/SLUG  DNA  binding  potentials  to  the  E-box  sequence  using  molecular  dynamics  and 
evolutionary  conserved  amino  acids.  J  Mol  Model.  2013  Sep;19(9):3463-9.  doi:  10.1007/s00894-013- 
1876-y.  Epub  2013  May  25.  PMCID:  PMC3745821. 

High  resolution  DNA  recognition  by  the  Snail  zinc  finger  protein:  Testing  of  a  Molecular 
Dynamics  based  model  defines  the  atomic  level  interactions  required  for  high  affinity  binding, 
E-box  specificity  and  reveals  potential  strategies  for  small  molecule  control  of  EMT 
transcriptional  programs 

To  verify  the  molecular  models  for  Snail-DNA  binding  we  performed  a  saturation  site-directed 
mutagenesis  study.  These  experiments  yielded  the  following  conclusions:  Multiple  AA-Base  contacts 
occur  on  both  strands  of  DNA;  All  4  ZF  might  not  contribute  equally  to  DNA  affinity  &  specificity;  High 
affinity  docking  of  the  model  does  not  require  large  change  in  DNA  structure:  B-form  DNA  is 
maintained  during  Snail  DNA  binding.  Moreover,  canonical  AA-base  contacts  seen  in  other  ZF 
structures,  but  new  modes  of  content  are  evident.  AA-Base  contacts  were  verified  and  revealed 
specifically  thru  mutagenesis.  TheW193  in  finger  2,  and  the  S221  in  finger  3  and  the  R247  in  the 
finger  4  determine  the  sequence  specificity  for  Snail  DNA-binding,  while  R191  in  finger  2  and  N222  in 
finger  3  stabilize  the  complex  via  phosphate  backbone  contacts.  Snail  has  high  specificity  for  DNA 
recognition  ,  which  requires  the  second  central  G  to  be  presented  in  the  E-box,  while  the  central  Gs 
are  not  required  for  Twist  bHLH  DNA-binding;  E-box  methylation  has  less  influence  for  Snail  DNA- 
binding  as  compared  to  bHLH  DNA-binding.  These  results  provide  the  first  comparative  analysis  of 
Twist  and  Snail  binding  the  canonical  E-box. 

In  preparation  for  submission. 

Snail  Recruits  RingIB  to  Mediate  Transcriptional  Repression  and  Cell  Migration  in  Pancreatic 
Cancer  Cells. 

Transcriptional  repressor  Snail  is  a  master  regulator  of  epithelial-mesenchymal  transition  (EMT),  yet 
the  epigenetic  mechanism  governing  Snail  to  induce  EMT  is  not  well  understood.  We  discovered 
elevated  levels  of  the  ubiquitin  E3  ligase  RingIB  and  Snail,  along  with  elevated  monoubiquitination  of 
H2A  at  K1 1 9  (H2AK1 1 9Ub1 ),  are  highly  correlated  with  poor  survival.  Mechanistic  investigations 
identified  Ringl  B  as  a  Snail-interacting  protein  and  showed  that  the  carboxyl  zinc  fingers  of  Snail 
recruit  RingIB  and  its  paralog  Ringl  A  to  repress  its  target  promoters.  Simultaneous  depletion  of 
Ringl  A  and  RingIB  in  cancer  cells  decreased  Snail  binding  to  the  target  chromatin,  abolished 
H2AK119Ub1  modification,  and  thereby  compromised  Snail-mediated  transcriptional  repression  and 
cell  migration.  We  found  that  RingIB  and  the  SNAG-associated  chromatin  modifier  EZH2  formed 
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distinct  protein  complexes  with  Snail  and  that  EZH2  was  required  for  Snail-RinglA/B  recruitment  to 
the  target  promoter.  Collectively,  our  results  unravel  an  epigenetic  mechanism  underlying 
transcriptional  repression  by  Snail,  suggest  Ringl  A/B  as  a  candidate  therapeutic  target,  and  identify 
H2AK119Ub1  as  a  potential  biomarker  for  diagnosis  and  prognosis. 

Chen  J,  Xu  H,  Zou  X,  Wang  J,  Zhu  Y,  Chen  H,  Shen  B,  Deng  X,  Zhou  A,  Chin  YE,  Rauscher  FJ  3rd, 
Peng  C,  Hou  Z.  Snail  Recruits  Ringl  B  to  Mediate  Transcriptional  Repression  and  Cell  Migration  in 
Pancreatic  Cancer  Cells.  Cancer  Res.  2014  Aug  15;74(16):4353-63.  doi:  10.1 158/0008-5472. CAN- 
14-0181  .  Epub  2014  Jun  5.  PMID:  24903147. 

LIMD2  is  a  small  LIM-only  protein  overexpressed  in  metastatic  lesions  which  regulates  cell 
motility  and  tumor  progression  by  directly  binding  to  and  activating  the  integrin-linked  kinase. 

Proteins  that  communicate  signals  from  the  cytoskeleton  to  the  nucleus  are  prime  targets  for  effectors 
of  metastasis  as  they  often  transduce  signals  regulating  adhesion,  motility,  and  invasiveness.  LIM 
domain  proteins  shuttle  between  the  cytoplasm  and  the  nucleus  and  bind  to  partners  in  both 
compartments,  often  coupling  changes  in  gene  expression  to  extracellular  cues.  As  part  of  our  DOD 
funded  work  on  Snail,  we  looked  at  potential  effectors  of  EMT.  LIMD2  is  a  mechanistically  undefined 
LIM-only  protein  originally  found  to  be  overexpressed  in  metastatic  lesions  but  absent  in  the  matched 
primary  tumor.  LIMD2  levels  in  fresh  and  archival  tumors  positively  correlate  with  cell  motility, 
metastatic  potential,  and  grade  including  bladder,  melanoma,  breast  and  thyroid  tumors.  LIMD2 
directly  contributes  to  these  cellular  phenotypes  as  shown  by  overexpression,  knockdown  and 
reconstitution  experiments  in  cell  culture  models.  The  solution  structure  of  LIMD2  determined  using 
NMR  revealed  a  classic  LIM-domain  structure  that  was  highly  related  to  LIM1  of  PINCH1,  a  core 
component  of  the  Integrin  Linked  Kinase-Parvin-Pinch  (IPP)  complex.  Structural  and  biochemical 
analyses  revealed  that  LIMD2  bound  directly  to  the  kinase  domain  of  ILK  near  the  active  site  and 
strongly  activated  ILK  kinase  activity.  Cells  that  were  null  for  ILK  failed  to  respond  to  the  induction  of 
invasion  by  LIMD2.  This  strongly  suggests  that  LIMD2  potentiates  its  biological  effects  through  direct 
interactions  with  ILK,  a  signal  transduction  pathway  strongly  linked  to  cell  motility  and  invasion.  In 
summary,  we  discovered  that  LIMD2  is  a  new  component  of  the  signal  transduction  cascade  that 
links  integrin-mediated  signaling  to  cell  motility/metastatic  behavior  and  may  be  a  promising  target  for 
controlling  tumor  spread.  Moreover,  recent  work  suggests  that  the  LIMD2-ILK  axis  is  directly  involved 
in  the  Snail  EMT  pathway. 

Peng  H,  Talebzadeh-Farrooji  M,  Osborne  MJ,  Prokop  JW,  McDonald  PC,  Karar  J,  Hou  Z,  He  M, 
Kebebew  E,  Orntoft  T,  Herlyn  M,  Caton  AJ,  Fredericks  W,  Malkowicz  B,  Paterno  CS,  Carolin  AS, 
Speicher  DW,  Skordalakes  E,  Huang  Q,  Dedhar  S,  Borden  KL,  Rauscher  FJ  3rd.  LIMD2  is  a  small 
LIM-only  protein  overexpressed  in  metastatic  lesions  that  regulates  cell  motility  and  tumor 
progression  by  directly  binding  to  and  activating  the  integrin-linked  kinase.  Cancer  Res.  2014  Mar 
1;74(5):1 390-1403.  PMID:  24590809. 

A  Novel  Phospho-Switch  in  the  Linker  Region  of  the  Snail  Zinc  Finger  Protein  which  regulates 
14-3-3  association,  DNA  binding  and  Epithelial-Mesenchymal  Differentiation. 

Regulation  of  transcription  factor  function  in  eukaryotes  often  occurs  at  the  level  of  DNA  binding  site 
recognition,  via  dimerization,  allosteric  regulation  of  the  DNA  binding  domain,  or  post-translational 
modification.  The  C2H2  family  of  zinc  finger  (ZF)  proteins  comprises  the  largest  family  of 
transcription  factors  (TFs)  existing  in  the  human  proteome;  their  ZFs  bind  to  DNA  as  monomers, 
however,  little  is  known  about  how  DNA  binding  is  regulated.  We  discovered  a  highly  conserved 
phospho-switch  in  the  ZF  domains  of  Snail  and  Slug,  a  subfamily  of  C2H2  ZFs  that  function  in  both 
normal  development  and  tumor  progression  to  metastasis  by  regulating  the  Epithelial-Mesenchymal 
Transition  (EMT)  program.  Using  highly  specific  phospho-Snail  antibodies,  we  showed  that  the 
phosphorylation  at  Seri  1  and  Thr177  sites  occurs  robustly  and  the  modified  Snail  proteins  are  bound 
to  Snail  target  genes  during  EMT.  Kinase  PRKG1  was  identified  to  modify  Snail  at  T177,  which  is 
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located  in  the  canonical  HC  linker  region  between  fingers  1  and  2.  Phosphorylation  at  T177  reduces 
Snail-DNA  interaction  while  recruiting  14-3-3  proteins.  Recognition  of  pTI  77-Snail  is  14-3-3  isoform- 
specific  and  is  modulated  by  the  specific  DNA  target  site  for  Snail.  A  molecular  dynamics  based 
model  of  the  pTI  77-Snail  /1 4-3-3  /DNA  complex  shows  dramatic  14-3-3-mediated  unfolding  of  finger 
1,  largely  supporting  the  experimental  observations.  In  conclusion,  we  discovered  a  novel 
phosphorylation  in  the  DNA  binding  region  of  Snail,  which  plays  a  critical  role  in  regulating  Snail- 
induced  EMT  through  dictating  the  states  of  Snail  in  protein  recruitment  and  protein-DNA  binding. 

Submitted  and  under  review:  J.  Mol.  Biol. 

An  evolutionarily  conserved  DNA  architecture  determines  target  specificity  of  the  Twist-family 
bHLH  transcription  factors 

Basic  helix-loop-helix  (bHLH)  transcription  factors  and  the  Snail  family  of  zinc  finger  proteins 
recognize  the  canonical  E-box  (CANNTG)  to  regulate  gene  transcription;  however,  given  the 
prevalence  of  E-boxes  in  a  genome,  it  has  been  puzzling  how  individual  bHLH  and  Snail  proteins 
selectively  recognize  E-box  sequences  on  their  targets.  Twist  is  a  bHLH  transcription  factor  that 
promotes  Epithelial-Mesenchymal  Transition  (EMT)  during  development  and  tumor  metastasis.  High 
resolution  mapping  of  Twist  occupancy  in  human  and  Drosophila  genomes  reveals  that  Twist,  but  not 
other  bHLH  proteins,  recognizes  a  unique  double  E-box  motif  with  two  E-boxes  spaced  preferentially 
by  five  nucleotides.  Using  molecular  modeling  and  binding  kinetic  analyses,  we  found  that  the  strict 
spatial  configuration  in  the  double  E-box  motif  aligns  two  Twist-E47  dimers  on  the  same  face  of  DNA, 
thus  providing  a  high  affinity  site  for  a  highly  stable  intramolecular  tetramer.  Biochemical  analyses 
showed  that  the  WR  domain  of  Twist  dimerizes  to  mediate  tetramer  formation,  which  is  functionally 
required  for  Twist-induced  EMT.  These  results  uncover  a  novel  mechanism  for  a  bHLH  transcription 
factor  to  recognize  a  unique  spatial  configuration  of  E-boxes  to  achieve  target  specificity.  Moreover, 
the  identification  of  this  double  E-box  allows  us  to  easily  distinguish  a  Snail  target  gene  from  a  Twist 
target  gene.  Whether  these  sets  of  genes  overlap  in  regulation  during  EMT  is  of  great  interest. 
Moreover,  the  WR-WR  domain  interaction  uncovered  here  sets  an  example  of  target  gene  specificity 
of  a  bHLH  protein  being  controlled  allosterically  by  a  domain  outside  of  the  bHLH  region  and 
represents  a  promising  target  for  small  molecule  therapeutics  directed  at  EMT  transcription  factors. 

Under  Revision:  Genes  &  Dev. 

As  described  above,  our  analyses  of  the  biochemical  properties  of  Snail  has  been  extremely 
successful.  However,  our  analysis  of  the  PAPSS1  and  2  genes  as  Snail  targets  has  been  fraught 
with  difficulties  over  the  entire  project  period.  This  has  been  noted  in  each  yearly  progress  report. 
Saying  it  simply:  Nothing  we  tried  worked.  These  2  genes  seem  to  be  highly  required  for  cell  viability 
and  any  change  to  their  status  via  knockdown  or  over  expression  leads  to  cell  death.  Specifically,  we 
could  not  overcome  the  following  problems: 

However,  we  have  had  unexpected  experimental  approach  difficulties  which  were  completely 
unforeseen  during  design  of  the  original  project.  Specifically,  a  cornerstone  experimental  procedure 
for  testing  our  hypothesis  that  PAPSS2  controls  the  mesenchymal  phenotype,  is  the  ability  to  stably 
eradicate  expression  of  PAPSS2  gene  expression  in  various  cell  lines  and  in  tumors.  The  state-of- 
the-art  technique  for  doing  this  is  miRNA  mediated  gene  knockdown.  However,  this  simply  does  not 
work  in  our  hands  for  this  particular  gene.  We  have  tried  multiple  independent  experimental 
approaches  to  miRNA  KO  including  transient  transfection,  (either  naked  RNA  or  synthetic  oligos), 
transient  vector  (lentiviral-mediated)  expression  of  miRNA  ,  and  finally  selection  of  independent  clonal 
lines  of  stable  expressing  miRNA  vectors.  The  results  show  that  contrary  to  any  gene  which  we  have 
ever  worked  with,  none  of  these  techniques  are  able  to  silence  PAPSS2  (or  PAPSS1 ).  The  cells 
either  immediately  shut  off  expression  of  the  transgene  miRNA  vector  or  the  cells  die.  Thus,  either 
PAPSS2  is  an  extremely  important  gene  for  normal  cell  homeostasis,  or  there  are  very  significant  off- 


7 


target  effects  in  our  constructs.  However,  the  latter  explanation  is  very  curious  as  we  have  used 
multiple  independent  targeting  sequences  for  PAPSS2,  which  should  control  for  these  effects.  In 
order  to  perform  the  comparative  genomic  and  proteomic  analyses  of  metastatic  cell  that  do  or  do  not 
have  PAAPSS2  expression  as  originally  proposed,  we  must  be  able  to  do  this.  This  must  await 
development  of  better  technologies  to  tackle  properly. 

Summary:  Despite  the  difficulties,  we  have  made  fundamental  new  discoveries  regarding  the  role  of 
Snail  and  other  E-box  binding  transcription  factors  in  EMT. 

KEY  RESEARCH  ACCOMPLISHMENTS:  1  Jul  201 1  -  30  Jun  2014 

This  project  seeks  to  establish  a  key  role  for  Snail  in  the  metastatic  progression  of  breast  cancer.  To 
this  end,  we  have  accomplished  the  following  tasks  over  the  entire  research  period: 

1 .  Identification  of  the  mode  of  DNA  binding  for  Snail. 

2.  Identification  of  PRKG1  as  the  key  kinase  that  modifies  Snail. 

3.  Demonstration  that  RINGIa/a,  14-3-3  and  PRKG1  directly  bind  to  Snail. 

4.  Demonstration  MCF-10A  cells  undergo  dramatic  morphologic  EMT  in  the  presence  of  WT 
Snail  expression,  with  the  downregulation  of  the  Cell  adhesion  molecule  E-cadherin  and  the 
upregulation  of  the  mesenchymal  markers  vimentin  and  fibronectin. 

5.  Identification  of  a  double  E-box  binding  site  for  Twist,  which  is  separate  and  distinct  from  that 
of  Snail. 

6.  Demonstration  that  cooperative  binding  to  the  double  E-box  is  mediated  by  the  WR  domain. 

7.  Discovery  that  LIMD2  induces  the  EMT  phenotype  and  may  be  an  effector  of  the  Snail  induced 
morphogenetic  pathway. 

REPORTABLE  OUTCOMES: 

•  Published  and  in  preparation,  in  revision,  in  review  manuscripts. 

Published  manuscripts: 

A  method  for  in  silico  identification  of  SNAIL/SLUG  DNA  binding  potentials  to  the  E-box  sequence 

using  molecular  dynamics  and  evolutionary  conserved  amino  acids. 

Prokop  JW,  Liu  Y,  Milsted  A,  Peng  H,  Rauscher  FJ  3rd. 

J  Mol  Model.  2013  Sep;19(9):3463-9.  doi:  10.1007/s00894-013-1876-y.  Epub  2013  May  25.  PMCID: 
PMC3745821 . 

LIMD2  is  a  small  LIM-only  protein  overexpressed  in  metastatic  lesions  that  regulates  cell  motility  and 

tumor  progression  by  directly  binding  to  and  activating  the  integrin-linked  kinase. 

Peng  H,  Talebzadeh-Farrooji  M,  Osborne  MJ,  Prokop  JW,  McDonald  PC,  Karar  J,  Hou  Z,  He  M, 
Kebebew  E,  Orntoft  T,  Herlyn  M,  Caton  AJ,  Fredericks  W,  Malkowicz  B,  Paterno  CS,  Carolin  AS, 
Speicher  DW,  Skordalakes  E,  Huang  Q,  Dedhar  S,  Borden  KL,  Rauscher  FJ  3rd. 

Cancer  Res.  2014  Mar  1  ;74(5):  1390-403.  doi:  10.1 158/0008-5472. CAN- 1 3-1 275.  PMID:  24590809. 

Snail  Recruits  RinqIB  to  Mediate  Transcriptional  Repression  and  Cell  Migration  in  Pancreatic  Cancer 

Cells. 

Chen  J,  Xu  H,  Zou  X,  Wang  J,  Zhu  Y,  Chen  H,  Shen  B,  Deng  X,  Zhou  A,  Chin  YE,  Rauscher  FJ  3rd, 
Peng  C,  Hou  Z. 

Cancer  Res.  2014  Aug  15;74(16):4353-63.  doi:  10.1 1 58/0008-5472.CAN-1 4-01 81 .  Epub  2014  Jun  5. 
PMID:  24903147. 
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CONCLUSION: 


In  summary,  we  have  been  highly  motivated  to  describe  completely  the  pathways  controlled  by  Snail 
through  activation  of  gene  expression  to  regulate  EMT  and  metastasis  in  breast  cancer  cells.  Our 
preliminary  studies  have  shown  that  targeting  of  this  transcription  factor  in  the  highly  aggressive, 
mesenchymal-like,  metastatic  human  breast  cancer  cells  should  lead  to  profound  reversal  of  the 
invasive  phenotype. 

So  What? 

These  results  together  suggest  that  a  small  molecule  therapeutics  strategy  targeting  Snail  or  the 
kinases  that  post-translationally  modify  these  proteins  and  other  biomolecules  are  critical  in  the 
regulation  of  EMT  and  metastasis  and  that  it  would  be  a  fruitful  strategy  to  move  forward  with.  This 
work  provides  the  critical  “proof-of-principle”  for  undertaking  these  future  studies. 
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A  method  for  in  silico  identification  of  SNAIL/SLUG  DNA  binding 
potentials  to  the  E-box  sequence  using  molecular  dynamics  and 
evolutionary  conserved  amino  acids 

JW  Prokop3’*,  Y  Liub,  A  Milsted3,  H  Pengb,  and  FJ  Rauscher  lllb 

aProgram  of  Integrated  Bioscience,  The  University  of  Akron,  Akron  OH  44325 

bThe  Wistar  Institute,  Philadelphia  PA  19104 

Abstract 

Binding  of  transcription  factors  to  DNA  is  a  dynamic  process  allowing  for  spatial-  and  sequence- 
specificity.  Many  methods  for  determination  of  DNA-protein  structures  do  not  allow  for 
identification  of  dynamics  of  the  search  process,  but  only  a  single  snap  shot  of  the  most  stable 
binding.  In  order  to  better  understand  dynamics  of  DNA  binding,  as  a  protein  encounters  its 
cognate  site,  we  have  created  a  computer  based  DNA  scanning  array  macro  which  sequentially 
inserts  high  affinity  DNA  consensus  binding  site  at  ah  possible  locations  in  a  predicted  protein- 
DNA  interface.  We  show  that  using  short  molecular  dynamic  simulations  at  each  location  in  the 
interface,  energy  minimized  states  and  decreased  movement  of  evolutionary  conserved  amino 
acids  can  be  readily  observed  and  used  to  predict  the  consensus  binding  site.  This  macro  is  applied 
to  SNAIL  class  C2H2  zinc  finger  family  proteins.  The  analysis  suggests  that  1)  SNAIL  binds  to 
the  E-box  in  multiple  states  during  encounter  with  its  cognate  site;  2)  several  different  amino  acids 
contribute  to  the  E-box  binding  in  each  state;  3)  the  linear  array  of  zinc  fingers  contributes 
differentially  to  overall  folding  and  base-pair  recognition,  and;  4)  each  finger  may  be  specialized 
for  stability  and  sequence  specificity.  Moreover,  the  macromolecular  movement  observed  using 
this  dynamic  approach  may  allow  the  NH2-terminal  finger  to  bind  without  sequence  specificity 
yet  result  in  higher  binding  energy.  This  macro  and  overall  approach  could  be  applicable  to  many 
evolutionary  conserved  transcription  factor  families  and  should  help  elucidate  better  the  varied 
mechanisms  used  for  DNA  sequence  specific  binding. 


Keywords 

SNAIL;  zinc  finger  recognition;  E-box  binding;  transcription  factor  binding;  protein-DNA 
dynamics 


Introduction 

How  eukaryotic  transcription  factors  find  their  cognate  DNA  binding  sites  in  the  promoters/ 
enhancers  of  target  genes  is  critical  to  understanding  how  these  factors  regulated  gene 
expression  and  hence  cellular  phenotype.  This  function  is  defined  in  the  context  of 
chromatin  and  large  excess  of  non-specific  DNA  binding  sites  in  the  nucleus.  Further 
complicating  our  understand  is  the  transcription  factors’  expression  patterns  [1],  disorder  of 
the  protein  structure  [2],  and  the  complicated  interactions  each  protein  has  [3,4]  in  a  network 
which  may  influence  DNA  binding  specificity.  With  many  proteins  we  have  the  power  to 
predict  protein  structure  and  function  [5,6];  however,  less  work  has  been  done  to  use 
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computer  algorithms  to  predict  how  transcription  factors  finds  and  regulate  a  high  affinity 
recognition  site.  A  short  DNA  sequence  for  transcription  factor  binding  has  a  high 
probability  of  being  ubiquitous  in  the  genome  (with  hundreds  of  thousands  of  sites),  yet  only 
a  fraction  of  these  sites  are  occupied  by  the  transcription  factor.  Even  fewer  of  these 
occupied  sites  yield  regulatory  roles  [7].  This  suggests  that  binding  is  a  dynamic  process, 
and  that  recruitment  and  chromatin  state  (spatial  components)  play  a  large  role. 

Transcription  factors  have  the  ability  to  “bind  and  search”  in  a  scanning  mechanism  in  the 
nucleus/DNA  to  identify  DNA  sequence  and  the  proteins  exist  in  multiple  states  of  DNA 
interaction  [8-1 1].  These  all  point  to  a  very  dynamic  role  of  proteins  in  how  they  interact 
and  identify  their  consensus  sequence  which  is  undervalued  in  solid  state  or  solution 
structures  of  DNA-protein  complexes. 

In  this  study  we  have  employed  in  silico  biology  to  develop  a  new  DNA  scanning  array 
program  of  a  consensus  DNA  sequence  through  a  model  of  a  protein-DNA  structure  to 
identify  dynamic  binding  properties.  We  have  elected  to  use  the  zinc  finger  transcription 
factor  SNAIL  and  its  homologous  family  to  address  the  potential  mechanism  involved  in 
DNA  binding  to  the  E-box  sequence  CAGGTG  [12].  SNAIL  contains  four  canonical  zinc 
fingers,  each  independently  folded  into  a  helix  and  an  antiparallel  beta  sheet  when 
complexed  to  zinc.  SNAIL  contains  three  C2H2  domains,  which  have  two  cysteines  (C)  and 
two  histidines  (H)  that  directly  coordinate  zinc.  A  fourth  finger  is  an  atypical  C3H  domain. 
While  the  cognate  E-Box  site  bound  by  SNAIL  has  been  known  for  more  than  15  years, 
remarkably  very  little  is  understood  about  how  SNAIL  interacts  with  DNA,  what  specific 
amino  acids  give  it  sequence  specificity,  and  what  the  structure  of  the  complex  is.  Herein  we 
propose  the  first  molecular  model  for  SNAIL-E-box  binding  using  this  molecular  dynamics 
based  approach.  This  modeling  approach  will  allow  for  many  future  studies  at  the  bench  top 
to  be  performed  on  the  SNAIL-E-box  complex,  allowing  for  a  better  understanding  of  how 
SNAIL  is  involved  in  cancer  and  cancer  metastasis.  Although  SNAIL  was  chosen  for  this 
study,  the  approach  can  be  applied  to  many  other  transcription  factor  families  with  the  hopes 
of  elucidating  mechanisms  and  networks  of  transcriptional  activation  and  repression. 


Methods 

Modeling  of  the  SNAIL  protein  with  DNA 

The  model  of  SNAIL,  containing  4  Zn  fingers  bound  to  DNA,  was  created  using  PDB 
structures  ltf3  (TLIIIA  protein,  for  zinc  fingers  1  and  2  of  SNAIL)  and  2il3  (an  artificial  Zn 
finger,  for  Zn  fingers  3  and  4  of  SNAIL)  based  on  the  most  homologous  sequence  of  known 
zinc  fingers  in  the  pdb  to  each  one  of  the  four  zinc  fingers  of  SNAIL.  To  create  the 
homology  model  of  SNAIL,  fingers  1  and  2  of  ltf3  were  parsed  and  amino  acids  changed  to 
those  present  in  fingers  1  and  2  of  SNAIL  based  on  CLUSTAL  [13]  sequence  alignment. 
These  fingers  (from  ltf3)  were  then  aligned  to  the  2nd  and  3rd  fingers  of  the  structure  2il3 
using  Mustang  [14].  Zn  fingers  1,  2,  3  and  6  of  2il3  were  then  deleted.  The  remaining  Zn 
fingers  (4  and  5)  were  mutated  in  silico  to  the  amino  acids  matching  fingers  3  and  4  of 
SNAIL  as  determined  by  alignments.  A  bond  was  created  joining  the  2nd  finger’s  carboxyl 
terminus  (originally  from  structure  ltf3)  to  the  amino  terminus  of  finger  3  (from  2il3).  This 
resulted  in  a  structure  contain  4  fingers,  each  with  its  own  Zn  ion,  and  a  20  mer  DNA 
sequence  (the  original  recognition  sequence  from  2il3).  The  linker  domain  between  fingers 
1  and  2  was  corrected  for  SNAIL,  which  contains  a  slightly  shortened  linker  domain 
compared  to  most  Zn  fingers.  The  final  SNAIL  sequence  (Ligure  1),  including  the  deletion 
of  two  amino  acids  from  structure  ltf3,  was  energy  minimized  to  relieve  bond  stress.  All 
atoms  were  then  freed  and  energy  minimizations  performed  multiple  times  using  AMBER03 
force  field  [15]  with  0.997  g/mL  of  water.  ConSurf  analysis  [16]  for  SNAIL,  SNAIL  like, 
Slug  and  Scratch  proteins  was  done  using  ClustalW  [13]  alignment  with  default  settings  and 
Bayesian  method  with  the  T92  model  for  phylogenetic  analysis. 
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Creation  and  molecular  dynamics  simulations  of  the  DNA  scanning  array 

To  the  modeled  protein-DNA  structure  above  the  DNA  sequence  was  changed  four  times  to 
one  of  four  DNA  bases  (A,  T,  C,  or  G)  on  the  first  DNA  strand  of  the  structure,  with  the 
complement  base  placed  on  the  opposite  strand  using  the  “6bp  Scanning  Array'  macro 
(Supplemental  file  pages  1 1-68)  in  YASARA  [17],  so  that  all  the  DNA  of  one  strand  is  the 
same.  This  creates  a  background  with  all  possible  DNA  bases  at  each  site  (four 
backgrounds),  which  serves  as  a  relative  starting  point  for  stability  when  scanning  the 
consensus  binding  sequence.  The  macro  then  inserts  the  6  base  pair  consensus  sequence 
(CAGGTG,  used  in  this  study)  at  multiple  sites  on  each  DNA  strand  with  all  four  DNA  base 
backgrounds,  generating  four  YASARA  scene  files  for  each  location.  Changes  to  only  the 
first  lines  of  the  macro  are  required  for  changing  the  sequence  of  the  consensus  site,  to  allow 
for  the  use  with  other  proteins.  At  each  site  the  “md  and  analysis  for  scanning  array  macro 
(supplemental  file  pages  69-121)  was  run.  Details  of  the  conditions  for  all  molecular 
dynamics  simulations  can  be  found  in  the  macro  scripts.  The  macro  runs  a  500  picosecond 
md  simulation,  which  then  calculates  multiple  root  mean  squared  deviations  (RMSD).  The 
score  for  each  amino  acid  from  the  Consurf  analysis  (ConsR,  on  a  scale  of  1-9  with  9  being 
highly  conserved  and  1  having  no  conservation,  Figure  SI)  was  divided  by  the  averaged 
heavy  atom  RMSD  (movement  of  residue,  Mr)  of  each  amino  acid  in  the  four  DNA  base 
pair  backgrounds  at  each  location  of  the  consensus  (Figure  2  Equation  1)  yielding  Xrz 
(amino  acid  R’s  conserved  movement,  at  the  Z  position  of  the  consensus  sequence).  This 
quantity  was  then  averaged  for  all  the  amino  acids  in  SNAIL  (Figure  2  Equation  2)  and 
normalized  to  the  value  from  the  control  DNA  sequences  without  a  consensus  (Figure  2 
Equation  3)  yielding  the  ConSurfreiz  (The  Consurf  total  relative  to  the  Z  position  of  the 
consensus  sequence).  For  the  DNA  base  pair  movement,  each  of  the  bases’  RMSD  from 
consensus  sequence  (M^z)  was  divided  by  the  same  location  on  the  control  DNA  sequence 
(Moo)  to  give  the  relative  movement  (M^naz)  (Figure  2  Equation  4).  This  was  then 
averaged  for  the  six  bases  of  the  consensus  sequence  (Figure  2  Equation  5)  yielding 
DNAreiz.  The  compiled  movement  (Rz)  was  calculated  by  dividing  the  ConSurfreiz  by  the 
DNAreiz  (Figure  2  Equation  6)  and  the  positions  ±1  (Figure  2  Equation  7)  or  +2  (Figure  2 
Equation  8)  of  the  Rz  were  added  to  address  dynamic  binding  and  transitions.  The  models 
were  analyzed  (z-score)  using  YASARA2  force  field  and  model  quality  (normality  of 
dihedrals  and  packing). 


Results 


The  overall  approach  was  to  create  a  program  which  could  help  illuminate  the  dynamic 
mechanism(s)  by  which  a  transcription  factor  binds  a  well-established  DNA  consensus 
sequence  using  a  computational  approach  which  could  be  advanced  to  high  throughput 
methodology.  This  method  should  be  able  to  confirm  or  identify  the  energy  stable  binding 
state,  while  also  addressing  the  dynamic  energy  landscape  in  which  that  binding  is  directed. 
On  the  energy  minimized  structure  of  SNAIL,  Zn  finger  1  was  found  around  DNA  base 
pairs  C13-C16,  finger  2  at  C9-C12,  finger  3  at  C6-C8,  and  finger  4  at  C3-C5.  The 
combination  of  the  two  macros  used  to  study  the  Zn  finger  SNAIL  binding  to  the  E-box 
sequence  CAGGTG,  allowed  for  placement  of  the  consensus  sequence  at  each  location 
throughout  the  DNA  strand  and  studied  the  energetics  of  the  DNA-protein  interaction  at 
each  location,  with  the  consensus  sequence  (C  of  CAGGTG)  starting  at  Cl  (C1-C6),  C2 
(C2-C7),. . ..,  C15  (C15-C20)  or  on  the  D  strand  of  DNA.  Potential  energies  of  the  modeled 
SNAIL  protein  from  the  multiple  consensus  locations  showed  few  sites  (D32  and  D33)  to 
have  lower  values,  suggesting  most  of  the  SNAIL  structures  are  stabilized  and  similar  in  all 
consensus  locations  (Figure  S2A).  Binding  energy  of  SNAIL  to  DNA  were  not  altered  much 
for  the  multiple  locations,  with  the  highest  value  at  the  location  C3  (Figure  3).  The  relative 
conserved  movement  (Rz)  was  highest  at  the  C4  location  (1.12581),  with  a  high  value  also 
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found  at  C13  (1.114211,  Figure  4A).  A  value  of  1  represents  the  Rz  calculated  for  the  DNA 
which  did  not  contain  a  consensus  sequence.  The  value  of  Z  +  1  (which  allows  a  single 
metric  for  multiple  binding  locations  to  identify  energy  profiles  with  multiple  sites  in  close 
proximity  with  high  binding  energies)  enabled  identification  of  two  areas  of  probable 
binding,  with  the  consensus  sequence  starting  around  C3-C5  and  again  around  C 12  to  Cl 3 
(Figure  4B).  The  Z  +  2  values  are  similar  to  the  Z  +  1  values  (Figure  S2B-C),  and  therefore 
do  not  allow  further  identification  of  possible  sites.  Future  experiments  should  allow  for 
optimizing  the  Z  +  1  value  for  quick  identification  of  probable  binding  sites. 

To  address  the  specifics  of  why  these  two  locations  (C4  and  Cl 3)  are  highly  favored, 
individual  components  of  our  calculations  were  studied.  The  DNA  movement  (DNArejz)  of 
the  consensus  sequence,  which  shows  the  stability  of  the  consensus  base  pairs  in 
simulations,  was  lowest  when  the  consensus  started  at  both  C4  and  the  C 13  locations  (Figure 
S3),  suggesting  increased  stability  due  to  the  protein  environment  at  that  location.  Next,  the 
Xrz  scores  were  used  to  address  the  amino  acids  contacting  the  consensus  sequence  at  each 
of  the  two  highly  favored  sites.  The  score  was  averaged  for  all  of  the  consensus  locations  in 
the  array  for  each  amino  acid  (Figure  5A)  to  understand  the  intrinsic  variation  of  the  assay. 

A  difference  from  each  location  of  the  array  to  this  average  allows  for  identification  of 
potential  amino  acid  leading  to  specificity  of  binding  and  stability  of  the  DNA  base  pair 
movement.  For  the  consensus  found  with  the  E-box  sequence  at  the  C4  location,  the  largest 
variation  was  found  at  amino  acids  246  and  247  with  a  difference  greater  than  2  (Figure  5B). 
Two  times  the  standard  error  of  the  average  for  these  amino  acids  was  only  0.322957  and 
0.295983,  suggesting  a  strong  significance  of  the  difference  seen  at  the  C4  location  from  all 
the  other  locations.  Both  of  these  amino  acids  are  found  in  the  fourth  zinc  finger  with  a 
highly  conserved  Ser  at  246  and  an  Arg  at  247.  The  hydroxyl  group  on  the  Ser  likely 
hydrogen  bonds  with  the  phosphate  backbone,  putting  the  Arg  side  chain  in  the  perfect 
location  to  hydrogen  bond  with  the  first  or  second  G  of  the  consensus  sequence  (CAGGTG, 
Figure  6).  The  average  structures  for  each  of  the  backgrounds  with  the  C4  location  confirm 
the  proper  distance  is  maintained  in  all  simulations  (Figure  S4A).  Mutation  of  these  two 
amino  acids  significantly  altered  the  ability  to  shift  the  E-box  sequence  (Figure  S4B). 
Tracking  the  score  of  Ser  246  and  Arg  247  over  the  entire  consensus  locations  shows  that 
they  are  higher  at  locations  flanking  the  E-box  consensus  at  the  C4  location  (C1-C5)  and  no 
other  consensus  locations  (Figure  S4C). 

Besides  Ser  246  and  Arg  247  of  finger  4,  several  amino  acids  are  found  in  the  normal 
positions  for  DNA  specificity  of  a  zinc  finger.  These  amino  acids  include  Met  171  of  finger 
1,  Arg  191/Trp  193  of  finger  2,  Ser  221  of  finger  3,  and  Met  248/Ser  249  of  finger  4.  Met 
171,  Arg  191,  and  Met  248  have  no  increased  Xrz  score  for  any  of  the  consensus  locations 
(Figure  S5),  thus  suggesting  they  do  not  contribute  to  specificity  of  binding.  Trp  193  has  an 
increased  Xrz  score  around  the  E-box  sequence  at  the  C9  location  with  additional  elevated 
levels  for  C8,  Cl  1  and  Cl 2.  These  locations  put  the  E-box  between  fingers  2  and  3.  Ser  221 
had  an  elevated  Xrz  score  at  E-box  location  C2  and  D35  while  Ser  249  showed  elevated 
values  at  C14.  As  the  E-box  locations  C 12-03  yielded  the  second  highest  Z  +  1  score,  the 
difference  between  each  consensus  Xrz  score  and  the  control  Xrz  score  was  observed. 

None  of  the  amino  acids  around  finger  one  or  two  (amino  acids  156-202)  contributed  to  the 
elevation  of  the  Xrz  score  (Figure  S6).  However,  using  heavy  atom  RMSD  values  for  these 
consensus  locations  showed  that  the  linker  region  between  fingers  1  and  fingers  2  (amino 
acids  176-180)  were  lower  and  therefore  more  stable  in  DNA  interaction  with  consensus 
sites  placed  starting  at  C12,  C13,  and  C14  (Figure  S7,  this  linker  region  is  not  conserved  in 
the  SNAIL  like  and  Scratch  proteins  and  therefore  received  no  scoring  in  the  Xrz  scores 
based  on  ConSurf  results).  In  summary  it  appears  that  critical  contact  residues  for  sequence 
specificity  are  Ser  246  and  Arg  247,  with  additional  contributions  from  the  first  linker 
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domain  and  amino  acids  Trp  193,  Ser  221,  and  Ser  249.  This  dynamic  approach  taken  here 
suggests  a  multiple  state  binding  for  SNAIL. 

Discussion 

Here  we  introduce  a  program  which  should  be  useful  in  localizing  DNA-protein  contacts  in 
a  cognate  transcription  factor-DNA  complex.  Many  structure  determination  methods  are 
based  on  static,  non-biological  conditions,  yet  biology  functions  in  dynamic  processes  in 
aqueous  environments.  Thus,  new  approaches  are  needed  to  address  how  proteins  bind  to 
DNA.  Recent  evidence  has  supported  a  role  of  zinc  fingers  binding  and  shuffling  on  DNA  at 
a  nanosecond  timescale  in  an  asymmetrical  role11.  Many  solid  state  structures  lack  this 
dynamic  component,  thus  there  is  a  clear  need  for  tools  to  help  determine  these  dynamic 
processes.  In  silico  experiments  provide  a  low  cost  preliminary  data  generation  relative  to 
expensive  NMR  and  protein  purification,  allowing  scientists  to  screen  a  larger  dataset  of 
proteins  for  their  potential  role  in  these  dynamic  processes.  By  using  either  known  or 
modeled  structures  of  proteins  interacting  with  DNA,  it  is  possible  to  elucidate  dynamic 
binding  mechanisms. 

SNAIL  is  involved  in  the  epithelial-mesenchymal  transition  (EMT)  involved  in  breast  [18] 
and  prostate  [19]  cancers.  This  involvement  in  cancer  regulation  is  through  an  E-box 
sequence  with  a  consensus  of  six  base  pairs  (CAGGTG).  The  canonical  model  shows  that 
each  zinc  finger  domain  recognizes  three  DNA  base  pairs  [20] .  SNAIL  contains  four,  highly 
conserved  fingers,  suggesting  that  only  some  of  the  fingers  are  involved  in  recognition  of  the 
E-box  sequence.  Until  now  it  has  been  unclear  which  fingers  contact  which  bases  in  the  E- 
box  consensus.  We  employed  an  in  silico  approach,  creating  a  consensus  sequence  array 
using  molecular  dynamics  simulations  and  evolutionary  conservation  to  determine  a  likely 
mechanism  of  binding.  Our  initial  model  before  the  scanning  array  yielded  a  z-score  of 
-0.997  and  with  the  consensus  location  starting  at  C4  of  -0.891.  Z-score  values  between  0 
and  -2  are  considered  fair,  and  the  calculations  of  the  z-scores  for  the  original  structures  are 
-1.261  for  ltf3  and  -0.433  for  2il3.  Although  subtle  variations  in  the  structure  may  exist 
than  those  of  our  model  for  SNAIL,  it  provided  an  opportunity  to  test  the  macro  created  in 
this  paper  while  providing  significant  hypothesis  generation  that  can  be  tested  in  benchtop 
experiments  for  the  SNAIL  system.  Our  final  models  of  SNAIL  bound  to  DNA  provide 
similar  results  (potential  and  binding  energies)  to  other  Zn  fingers  complexed  to  DNA  and 
thus  provide  a  strong  starting  point  for  threading  the  E-box  consensus  sequence  through  the 
DNA  of  our  structure.  Further  comparisons  of  SNAIL  structure  to  other  Zn  finger  proteins 
binding  different  consensus  sequences  and  other  non-Zn  finger  proteins  binding  to  the  E-box 
sequence  will  be  detailed  in  subsequent  publications.  In  addition,  the  role  of  amino  acid  246 
and  247  have  been  confirmed  through  use  of  bench  top  methods. 

Although  details  of  this  method  are  still  being  actively  modified  for  use  with  other  proteins, 
the  length  of  md  simulation  (500ps)  in  this  manuscript  appears  to  allow  for  stability  of  the 
protein  in  simulation,  as  can  be  seen  in  the  representative  plots  for  both  energy  and  carbon 
alpha  RMSD  of  the  C4  location  (Figure  S8).  This  time  was  initially  determined  as  RMSD 
values  stabilized  around  100  picoseconds  of  SNAIL  interaction  with  random  DNA,  allowing 
for  full  400  picoseconds  of  stabilized  simulations.  This  amount  of  time  we  suggest  optimizes 
for  detection  of  stability  of  critical  amino  acids,  while  reducing  the  computational 
requirements  on  performing  hundreds  of  molecular  dynamics  simulations  for  longer  periods 
of  time.  Each  protein-DNA  complex  will  need  to  be  adjusted  and  validated  for  the  length  of 
simulations  to  allow  for  stability  of  the  complex. 

Results  from  this  novel  in  silico  approach  suggests  that  binding  likely  occurs  through  the 
fourth  finger  with  conserved  contacts  created  by  amino  acids  Ser  246  and  Arg  247. 
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Additional  contacts  on  fingers  two  and  three  maintain  a  shuffling  protein  around  this 
consensus  sequence.  As  this  small  shuffle  of  SNAIL  on  the  DNA  E-box  sequence  takes 
place,  finger  one  may  stabilize  and  bind  tighter  with  minimal  to  no  sequence  specificity, 
through  electrostatic  interaction  of  the  polar  basic  amino  acids  of  SNAIL  with  the  phosphate 
backbone  of  the  DNA.  As  the  linker  domain  between  fingers  one  and  two  is  smaller  and 
atypical  from  most  other  zinc  finger  proteins,  it  may  require  more  time  to  stabilize  with  the 
phosphate  backbone.  This  mechanism  would  allow  for  the  SNAIL  proteins  to  scan  the  DNA 
and  when  the  DNA  consensus  identified,  SNAIL  to  reduce  its  movement  to  a  shutter  around 
the  site  and  have  a  tighter  binding  through  stabilization  of  finger  one  (Figure  7).  Interactions 
of  this  linker  domain  with  14-3-3  [21]  may  alter  the  stability  and  binding  affinity  of  SNAIL 
to  E-box  sequence.  Similar  mechanisms  may  also  exist  to  that  previously  shown  for  Egr-1 
[11]  allowing  finger  one  of  SNAIL  to  bind  to  another  strand  of  DNA,  translocating  the  other 
fingers  to  another  DNA  strand. 

These  results,  in  this  study,  provide  the  first  molecular  model  for  SNAIL  class  zinc  finger 
protein  binding  to  the  E-box,  a  DNA  sequence  of  high  biological  and  disease  relevance.  This 
model  may  serve  as  a  tool  in  understanding  cancer  progression  and  metastasis,  allowing  for 
drug  design  to  specific  amino  acids.  The  data  generated  using  this  in  silico  DNA  scanning 
array  can  be  used  to  suggest  amino  acids  to  mutate  for  benchtop  analysis.  At  minimal  cost 
(excluding  time  to  run  simulations  and  building  computers)  to  perform,  this  allows  for  a 
screening  of  potential  mechanisms  that  are  conserved  in  a  family  of  transcription  factors. 
When  combined  with  mutagenesis,  DNA  gel  shift  assays,  NMR  and  other  molecular/ 
biochemical  techniques,  it  provides  a  strong  addition  to  our  understanding  of  protein-DNA 
interactions  in  a  dynamic,  rather  than  static,  process. 

Conclusions 

Use  of  this  in  silico  DNA  scanning  array  has  elucidated  a  potential  mechanism  for  SNAIL- 
E-box  sequence  specificity.  A  multiple  state  binding  mechanism  appears  likely,  in  which  the 
first  of  four  zinc  fingers  on  SNAIL  may  form  tight  complexes  when  fingers  2-4  complex  to 
the  E-box.  The  use  of  these  methods  can  be  applied  to  all  transcription  factor  families  with 
hopes  of  further  explaining  DNA  sequence  specificity. 

Supplementary  Material 

Refer  to  Web  version  on  PubMed  Central  for  supplementary  material. 
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Finger  1  Finger  2 

KAFN  CKYCNKEYLSLGALKMHIRSH  TLPCV  CGTCGKAFSRPWLLQGHVRTH  TGEKPFS 
156  '  176  182  202 


Finger  3  Finger  4 

CPHCSRAFADRSNLRAHLQTH  SDVKKYQ  CQACARTFSRMSLLHKHQESGC  S 

210  230  238  259 

Figure  1.  Sequence  of  SNAIL  zinc  fingers 

Sequence  used  of  SNAIL  to  make  model  with  the  four  zinc  fingers  highlighted  in  yellow 
and  the  amino  acids  interacting  with  zinc  shown  in  red.  Number  is  based  on  human  SNAIL. 
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Figure  2.  Math  equations  used  in  the  DNA  scanning  array  analysis 

ConsR  =  conservation  score  of  each  residue  (based  on  ConSurf  results),  Mr  =  heavy  atom 
RMSD  of  each  residue,  Xrz  =  conserved  movement  for  amino  acid  R  with  consensus  at  the 
Z  location.  Xrq  =  conserved  movement  for  amino  acid  R  with  no  consensus  sequence, 
ConSurfrejz  =  relative  conserved  movement  of  the  protein  with  the  consensus  at  the  Z 
location,  M^z  =  movement  of  one  of  the  bases  of  the  consensus  site  of  the  DNA,  = 
movement  of  the  base  Mp)z  in  the  background  with  no  consensus  sequence,  Mdnaz  = 
relative  movement  of  the  consensus  DNA  sequence.  DNAreiz  =  average  movement  relative 
to  the  background  of  the  consensus  sequence  at  each  position  (Z),  Rz  =  score  for  conserved 
movement  of  the  protein  and  the  DNA  at  position  Z  of  the  consensus  sequence,  Z+l(2)  = 
additive  scores  for  positions  plus  or  minus  one  site  (or  2)  around  the  Z  site  of  the  consensus 
location. 
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Figure  3.  Binding  energies  of  SNAIL  at  each  location  of  the  consensus  DNA  or  the  control 

Binding  energy  of  SNAIL  to  DNA  from  the  energy  minimized  structure  for  each 
background  at  various  locations  of  the  two  strands  (C1-C15  and  D21-D35)  as  determined  in 
YASARA  with  AMBER03  force  field. 
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Figure  4.  Calculated  conserved  movement  scores 

A)  The  relative  conserved  movement  (Rz)  for  each  site  of  the  consensus  on  the  C  DNA 
strand  (left)  or  the  D  strand  (right).  B)  Adding  the  Rz  to  the  consensus  shifted  to  the  left  or 
right  by  one  shows  two  highly  conserved  favorable  binding  sites  on  the  C  strand  (left)  while 
none  on  the  D  (strand).  A  value  of  1  for  A  or  3  for  B  would  be  the  score  of  the  control. 
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Figure  5.  Xrz  score  for  each  amino  acid 

A)  Average  Xrz  for  each  amino  acid  over  all  locations  of  the  DNA  consensus  sequence 
with  the  error  bars  representing  plus  or  minus  the  standard  error  times  two  of  the  30 
locations  of  the  consensus  as  well  as  the  no  consensus  control.  B)  Xrz  score  for  the  C4 
location  of  the  consensus  minus  the  average  for  each  amino  acid  shown  in  A.  The  largest 
differences  were  seen  at  amino  acids  246  and  247  (orange). 
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Figure  6.  C4  and  C13  consensus  locations  on  the  DNA 

Top  shows  the  DNA  in  ball  and  sticks  while  the  bottom  shows  the  DNA  molecular  surface. 
Highlighted  in  the  box  is  the  location  of  the  Arg  247  interacting  with  the  DNA  consensus 
sequence  of  the  C4  location  while  Ser  246  interacts  with  the  phosphate  backbone  allowing 
for  stability  of  Arg  247. 
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Figure  7.  Potential  DNA  binding  mechanism  of  SNAIL  family 

1)  Fingers  2-3  of  SNAIL  form  weak  complex  with  DNA  with  no  sequence  specificity.  2) 
SNAIL  moves  on  the  DNA.  3)  SNAIL  binds  to  the  E-box  sequence  and  stabilizes.  4) 
Decreased  dynamics  allows  finger  1  to  stabilize  and  tightly  bind  the  E-box  sequence.  5) 
With  finger  one  unbound,  it  may  be  able  to  translocate  SNAIL  to  another  DNA  strand 
similar  to  mechanisms  seen  in  Egr-1. 
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Abstract 

Proteins  that  communicate  signals  from  the  cytoskeleton  to  the  nucleus  are  prime  targets  for  effectors  of 
metastasis  as  they  often  transduce  signals  regulating  adhesion,  motility,  and  invasiveness.  LIM  domain  proteins 
shuttle  between  the  cytoplasm  and  the  nucleus,  and  bind  to  partners  in  both  compartments,  often  coupling 
changes  in  gene  expression  to  extracellular  cues.  In  this  work,  we  characterize  LIMD2,  a  mechanistically 
undefined  LIM-only  protein  originally  found  to  be  overexpressed  in  metastatic  lesions  but  absent  in  the  matched 
primary  tumor.  LIMD2  levels  in  fresh  and  archival  tumors  positively  correlate  with  cell  motility,  metastatic 
potential,  and  grade,  including  bladder,  melanoma,  breast,  and  thyroid  tumors.  LIMD2  directly  contributes  to 
these  cellular  phenotypes  as  shown  by  overexpression,  knockdown,  and  reconstitution  experiments  in  cell  culture 
models.  The  solution  structure  of  LIMD2  that  was  determined  using  nuclear  magnetic  resonance  revealed  a 
classic  LIM-domain  structure  that  was  highly  related  to  LIM1  of  PINCH1,  a  core  component  of  the  integrin-linked 
kinase -parvin-pinch  complex.  Structural  and  biochemical  analyses  revealed  that  LIMD2  bound  directly  to  the 
kinase  domain  of  integrin-linked  kinase  (ILK)  near  the  active  site  and  strongly  activated  ILK  kinase  activity.  Cells 
that  were  null  for  ILK  failed  to  respond  to  the  induction  of  invasion  by  LIMD2.  This  strongly  suggests  that  LIMD2 
potentiates  its  biologic  effects  through  direct  interactions  with  ILK,  a  signal  transduction  pathway  firmly  linked  to 
cell  motility  and  invasion.  In  summary,  LIMD2  is  a  new  component  of  the  signal  transduction  cascade  that  links 
integrin-mediated  signaling  to  cell  motility/metastatic  behavior  and  may  be  a  promising  target  for  controlling 
tumor  spread.  Cancer  Res;  74(5);  1390-403.  ©2014  AACR. 


Introduction 

Defining  the  complex  biology  and  the  cascade  of  events  that 
lead  to  metastatic  spread  of  primary  tumors,  both  locally  and  to 
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distant  sites,  continue  to  be  major  unmet  needs  in  cancer  biology 
(1).  Moreover,  defining  which  molecular  events  in  both  the 
metastatic  cascade  and  in  the  maintenance  of  tumor  dormancy 
are  targetable  for  therapeutic  or  preventative  benefit  is  an  even 
more  daunting  task.  These  results  have  defined  molecules  that 
control  a  large  array  of  cellular  phenotypes,  including  cell 
motility,  cell-cell  and  cell-matrix  interactions,  and  immune 
evasion  (2,  3).  Generally,  it  seems  that  rare  metastatic  variants 
appear  stochastically  in  a  genetically  heterogeneous  primary 
tumor,  occurring  quite  early  in  tumor  progression,  and  that 
normal  developmental  processes,  such  as  epithelial-mesenchy¬ 
mal  transition  (EMT),  mesenchymal-epithelial  transition  (MET), 
and  angiogenic  cascades,  are  often  ectopically  activated  to 
achieve  tumor  spread  (1, 4, 5).  Both  forward  genetic  and  descrip¬ 
tive  experimental  approaches  have  been  utilized  to  identify 
genetic  and  epigenetic  determinants  for  metastatic  capability, 
largely  by  selection  and  analysis  of  metastatic  variants  in  popu¬ 
lations,  or  by  comparing  the  expression  and  mutation  profiles  of 
matched  primary  and  metastatic  lesions  using  the  vast  spectrum 
of  "-omic"  technologies  currently  popular  (6-8).  The  finding  of 
metastasis-associated  antigens  and  transcriptional  and/or 
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LIMD2,  a  Tumor  Progression  Effector,  Regulates  Cell  Motility 


Figure  1.  A,  LIMD2  and  the  LIM- 
only  protein  family.  B,  LIMD2  is 
most  closely  related  to  CRP1 .  C, 
the  PINCH1-LIM1  and  LIMD2  LIM 
domains  are  homologous.  The 
zinc-chelating  residues  are 
highlighted  in  red;  the  conserved 
amino  acids  are  highlighted  in 
gray.  D,  antibodies  robustly  detect 
LIMD2  protein.  *,  nonspecific 
binding.  E,  TPC1  cells  were 
transfected  with  myc-LIMD2  then 
fixed  and  stained  with  both  anti- 
myc  tag  antibody  (red)  or  anti- 
LIMD2  mAb  (green).  The  cells  were 
counterstained  with  DAPI  to 
highlight  the  nucleus  (blue).  The 
red  arrows  indicate  concentrations 
of  Myc-  and  LIMD2  costaining 
signal,  which  is  at  the  leading  edge 
of  the  cells  and  also  is  present  in 
streaks  reminiscent  of  vinculin- 
containing  adhesion  plaques. 
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genetic  signatures  in  these  comparisons  has  been  a  useful 
exercise,  and  may  be  useful  in  the  clinic,  but  these  approaches 
often  fail  to  distinguish  drivers  of  the  metastatic  phenotype  from 
passenger/ markers  of  the  phenotype  and,  moreover,  rarely  led  to 
specific  mechanisms.  In  this  study,  we  characterized  the  LIMD2 
protein,  which  originally  identified  as  highly  and  exclusively 
overexpressed  in  metastatic  lesions  but  absent  in  matched 
normal  tissue  or  primary  tumor  (9). 

LIMD2  is  a  LIM-only  domain  protein  that  was  identified  as  a 
biomarker  for  papillary  thyroid  carcinoma  (PTC)  lymph  node 
metastasis  (LNM)  from  molecular  profiling  of  matched  sam¬ 
ples  (9).  LIMD2  was  found  to  be  highly  expressed  in  LNM  but 
absent  from  the  primary  tumor  or  normal  thyroid  tissue  in 
matched  patient  PTC  samples,  suggesting  that  LIMD2  expres¬ 
sion  could  provide  an  improved  means  of  detecting  potentially 
metastatic  PTC  cells  during  initial  staging  of  a  newly  diagnosed 
carcinoma.  In  the  human  genome,  there  are  135  identifiable 
LIM-encoding  sequences  located  within  58  genes.  The  LIM 
domain  is  organized  as  a  tandem  zinc-finger  structure  that 


functions  as  a  modular  protein-binding  interface  (Fig.  1A).  LIM 
domain-containing  proteins  have  diverse  cellular  roles  such  as 
regulators  of  gene  expression,  cyto-architecture,  cell  adhesion, 
cell  motility,  and  signal  transduction.  LIM  domain  proteins  are 
emerging  as  key  molecules  in  a  wide  variety  of  human  cancers 
(10).  In  particular,  all  members  of  the  human  LIM  domain-only 
(LMO)  proteins,  LMOl  to  LM04,  which  are  required  for  many 
normal  developmental  processes,  are  implicated  in  the  onset 
or  progression  of  several  cancers,  including  T-cell  leukemia, 
breast  cancer,  and  neuroblastoma.  Here,  we  report  that  LIMD2 
regulates  cell  motility  and  is  a  novel  effector  of  tumor  pro¬ 
gression  via  its  role  in  the  integrin-linked  kinase  (ILK)  pathway. 

Materials  and  Methods 

Plasmids,  antibodies,  cell  culture,  transfection, 
migration  assays,  soft-agar  assays,  GST-binding  assays, 
immunofluorescence,  and  immunohistochemistry 

The  plasmids  pET-28a(+)-LIMD2,  pGEX-4Tl-LIMD2, 
pcDNA3. 1  -Flag-LIMD2,  pcDNA3.1-Flag-LIMDl,  and  pcDNA3.1- 
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Flag-AJUBA  were  constructed  through  PCR-based  cloning 
(cloning  details  available  on  request).  Sport6-LIMD2  (human) 
and  pLK0.1-shLIMD2  (human)  plasmids  were  purchased  from 
Open  Biosystems.  pET-32-TT-ILK,  1-174  (His),  and  pCOLA- 
Duet-PINCHl-LIMl  (GST)  plasmids  are  kindly  provided  by 
David  Calderwood  and  Titus  Boggon  (11).  Preparation  of  the 
His-LIMD2  and  GST-LIMD2  fusion  proteins  were  performed  as 
described  previously  (12)  and  yielded  soluble  protein  purified 
under  native  conditions.  The  DNA  region  encoding  human 
LIMD2  amino  acids  1  to  127  was  isolated  via  PCR  with 
synthetic  oligonucleotides  containing  5'  Bam  HI  and  3'  Hindlll 
and  the  digested  DNA  fragment  cloned  into  pQE30  vector  at 
the  corresponding  restriction  sites.  This  produced  a  protein 
that  encoded  the  N-terminal  vector-encoded  peptide  of 
MRGSHHHHHHGS-.  Followed  by  AA1-127  of  hLIMD2  followed 
by  a  stop  codon.  This  was  used  to  produce  rabbit  polyclonal 
antibody  using  standard  techniques.  For  monoclonal  antibody, 
the  DNA  region  encoding  LIMD2  (AA1-127)  was  isolated  via 
PCR  with  synthetic  oligonucleotides  containing  5'  EcoBl  and  3' 
Xhol,  and  the  digested  DNA  fragment  cloned  into  pGEX4T-l 
vector  at  the  corresponding  restriction  sites.  This  produced  a 
protein  that  encoded  the  N-terminal  vector-encoded  GST, 
followed  by  AA1-127  of  hLIMD2  followed  by  a  stop  codon. 
The  mAb  used  in  this  study  was  given  the  Wistar  designation 
"LIMD2  clone  #18.30"  and  was  immunoglobulin  G  (IgG)  iso- 
type.  Both  sources  of  crude  antibodies  were  affinity  purified 
using  either  the  T-Gel  or  the  SulfoLink  Kits  (Pierce)  using  full- 
length,  native  LIMD2  protein  coupled  to  the  resin. 

HEK293,  U20S,  MCF-7,  MDA-MB-231,  MDA-MB-435,  and 
mouse  fibroblast  (ILK+/+  and  ILK_/_)  cells  were  maintained 
in  Dulbecco's  modified  Eagle  medium  (DMEM)  containing 
10%  FBS,  2  mmol/L  L-glutamine,  and  penicillin  (50  units/ 
mL)/streptomycin  (50  pg/mL;  P/S)  at  37°  C  under  5%  C02  in 
a  humidified  chamber.  MCF-10A  mammary  epithelial  cells 
were  maintained  in  DMEM:Ham's  F12  containing  5%  house 
serum,  epidermal  growth  factor  (10  ng/mL),  hydrocortisone 
(0.5  pg/mL),  cholera  toxin  (100  ng/mL),  insulin  (10  pg/mL), 
and  P/S.  Melanoma  cell  lines  (Foml96-l,  RGP-WM35,  RGP- 
WM3211,  VGR-WM793,  and  Metl205  Lu)  were  maintained  in 
Tu  2%  medium  containing  80%  MCDB153,  20%  Leibovitz's  L- 
15,  2%  FBS,  insulin  (bovine;  5  pg/mL),  and  1.68  mmol/L 
CaCl2.  Bladder  cancer  cell  lines  (T24  and  RT4)  were  main¬ 
tained  in  McCoys  5A  containing  10%  FBS,  1.5  mmol/L  l- 
glutamine,  2.2  g/L  sodium  bicarbonate,  and  P/S.  Bladder 
cancer  cell  lines  (HT1376,  UM-UC3,  ScaBer,  J82,  and 
TCCSUP)  were  cultured  in  Eagle's  Minimum  Essential  Medi¬ 
um  containing  10%  FBS,  2  mmol/L  L-glutamine,  1.5  g/L 
sodium  bicarbonate,  and  P/S.  Thyroid  cancer  cell  lines 
(XTC-1,  FTC133,  FTC236,  FTC238,  and  TPC-1)  were  main¬ 
tained  in  DMEM  containing  10%  FBS,  thyroid-stimulating 
hormone  (TSH;  bovine;  0.01  units/mL),  insulin  (human;  10 
pg/mL),  P/S,  and  amphotericin  B  (250  ng/mL).  Transient 
transfection  in  HEK293  and  U20S  cells  using  lipofectamine 
were  performed  as  described  (13).  Short  hairpin  RNA 
(shRNA)-mediated  knockdown  using  lentiviral  vectors  was 
performed  as  described.  The  siRNAs  directed  against  human 
LIMD2  were  obtained  from  Open  Biosystems.  The  infected 
cells  were  grown  in  puromycin  selection  medium  for  1  week, 


and  the  pool  was  used  for  Western  blotting  and  for  func¬ 
tional  assays.  Whole  cell  extracts  were  prepared  for  Western 
blotting. 

Cell  migration  assays  were  performed  using  standard  trans¬ 
well  chambers  (8-pm  pore  size;  Corning).  The  cells  were  serum 
starved  for  4  hours  before  seeding  for  the  transwell  assay.  After 
detachment  with  trypsin,  the  cells  were  washed  gently  with 
PBS  and  resuspended  in  serum-free  medium.  A  250  pL  cell 
suspension  (2  x  105  cells/mL)  was  added  to  the  upper  cham¬ 
ber.  Triplicate  wells  were  done  for  each  assay.  The  complete 
medium  was  added  to  the  bottom  chamber.  After  overnight 
incubation,  cells  on  the  filters  were  fixed  (glutaraldehyde)  and 
stained  with  0.5%  toluidine  blue.  The  number  of  migrated  cells 
on  each  filter  was  quantified  manually  by  counting  all  cells  in  4 
separate  fields  under  x  10  magnification  using  a  microscope. 
The  mean  number  of  cells  in  triplicate  filters  was  calculated 
and  is  reported  in  each  bar  of  the  graph,  ±SD  for  1  experiment. 
Each  experiment  was  repeated  at  least  twice.  T  tests  were 
performed  and  P  values  were  obtained  to  compare  the  different 
cell  lines  used. 

For  immunofluorescence,  TPC1  cells  grown  on  sterile  cov- 
erslips  were  transfected  with  pcDNA3.1  myc-LIMD2;  24  hours 
later  the  media  was  aspirated  off  and  the  cells  were  rinsed  2 
times  in  1  x  PBS.  The  cells  were  fixed  with  4%  formaldehyde 
for  15  minutes  at  room  temperature  (RT).  Then  the  cells  were 
rinsed  3  times  with  1  x  PBS  for  5  minutes  each.  The  cells  were 
permeated  with  0.2%  Triton  X-100  for  5  minutes.  The  cells 
were  rinsed  3  times  with  1  x  PBS  for  5  minutes  each.  The  cells 
were  blocked  for  1  hour  in  5%  goat  serum.  The  cells  were  then 
incubated  overnight  at  4°C  with  the  following  dilutions  of 
primary  antibodies  (made  in  5%  goat  serum):  1:1,000  LIMD2 
(mAb),  1:200  Myc-tag  antibody  (Cell  Signaling  Inc.).  The  cells 
were  rinsed  3  times  with  1  x  PBS.  Then  the  cells  were  incu¬ 
bated  with  1:1,000  dilution  of  secondary  anti-mouse  Alexa 
Fluor  594  and  anti-rabbit  Alexa  Fluor  647  antibodies  (diluted 
in  5%  goat  serum)  for  30  minutes  at  RT  (in  the  dark).  The  cells 
were  rinsed  3  times  with  1  x  PBS  for  5  minutes  each.  The  cells 
were  counterstained  with  DAPI  for  2  minutes  and  rinsed  3 
times  in  lx  PBS  for  5  minutes  each.  The  coverslip  was 
mounted  on  a  glass  slide  with  a  drop  of  Prolong  Gold  anti¬ 
fade  reagent  and  observed  by  confocal  microscopy. 

The  soft  agar  assay  was  performed  as  described  by  the 
Wallert  and  Provost  Lab.  The  base  agar  is  composed  of  0.5% 
agar,  lx  DMEM,  and  10%  FBS.  The  top  agar  is  formed  by 
mixing  equal  volume  of  2  x  DMEM,  20%  FBS,  0.7%  agar,  and  the 
cell  suspension  to  give  1  x  DMEM,  10%  FBS,  0.35%  agar,  and  2  x 
104  or  4  x  104  cells/well  (6-well  plate).  The  plates  were 
incubated  at  37°  C  in  humidified  incubator  for  2  to  3  weeks. 
The  cells  were  fed  2  times  per  week  with  cell  culture  media.  The 
plates  were  stained  with  0.5  mL  of  0.005%  Crystal  Violet  for  1 
hour  and  the  colonies  in  each  well  were  quantified  by  counting 
4  fields  under  2x  magnification  using  a  microscope.  Each 
condition  was  done  in  triplicate  and  the  average  number  of 
colonies  per  well  is  represented.  Each  experiment  was  repeated 
3  times. 

The  GST  association  assays  using  in  vitro  transcribed  and 
translated  (IVT)  protein  was  performed  as  described  (12).  Full- 
length,  35S-labeled  ILK  protein  was  used  for  binding  to  the 
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bacterially  produced  GST  fusion  protein,  and  the  association 
and  washing  buffer  was  1  x  radioimmunoprecipitation  assay 
buffer.  Standard  IHC  techniques  were  used  to  stain  paraffin- 
embedded  tissues,  except  that  sections  were  subjected  to 
antigen  retrieval  by  pressure  boiling  in  10  mmol/L  sodium 
citrate,  0.05%  Tween  20,  pH  6.0,  for  3  minutes  at  120°  C.  The 
sections  were  blocked  with  1%  normal  horse  serum  for  30 
minutes,  incubated  with  aLIMD2  (mAb)  overnight  at  4°C 
(1:100  dilution  of  a  1  mg/mL  IgG  stock)  and  followed  by  30 
minutes  incubation  with  horse  anti-mouse  biotinylated  immu¬ 
noglobulin  (1:200).  Subsequent  steps  followed  the  Vectastain 
Elite  ABC  Kit.  The  staining  intensity  and  percentage  of  positive 
cells  were  scored  using  an  ordinal  scale  blinded  to  the  clinical 
and  pathologic  data.  Staining  intensity  score  was  0  for  absent,  1 
for  faint,  2  for  moderate,  and  3  for  strong.  The  percentage  of 
positive  cells  score  was  0  for  0%,  1  for  less  than  33%,  2  for  >33% 
and  <66%,  and  3  for  >66%. 

Nuclear  magnetic  resonance  spectroscopy 

The  hLIMD2  cDNA  was  cloned  into  pMAL-c5TEV  (modified 
version  of  pMAL-c5X  from  NEB)  and  expressed  in  Escherichia 
coli  BL21  (DE3).  The  cells  were  grown  at  37°  C  until  OD600  0.7  and 
then  induced  with  0.5  mmol/L  of  isopropylthio-P-galactoside  at 
25°  C  for  20  hours.  The  harvested  cells  were  suspended  in  lysis 
buffer  [20  mmol/L  Tris  pH  7.4,  200  mmol/L  NaCl,  100  qmol/L 
ZnS04,  0.2%  IGEPAL,  complete  protease  inhibitor  (1  tablet  per 
liter  of  the  culture),  0.1  mg/mL  lysozyme,  0.5  mmol/L  phenyl- 
methylsulfonyl  fluoride,  and  10  mmol/L  p-mercaptoethanol]. 
The  cells  were  disrupted  by  sonication,  lysate  was  cleared  by 
centrifugation  at  18,000  rpm,  and  applied  to  amylose  resin 
(NEB)  equilibrated  with  the  equilibrate  buffer  (20  mmol/L  Tris 
pH  7.4,  200  mmol/L  NaCl,  100  qmol/L  ZnS04,  and  10  mmol/L 
p-mercaptoethanol)  at  4°C  for  1  hour.  The  resin  was  washed  5 
times  with  20  mL  of  equilibration  buffer.  MBP-fusion  protein 
was  eluted  with  10  mmol/L  maltose  and  cleaved  by  100  pg  of 
TEV  protease  per  liter  of  the  culture  at  16°  C  overnight.  The 
cleaved  protein  was  subsequently  purified  by  MonoS  4.6/100  PE 
and  HiLoad  16/60  Superdex  75  pg  [pre-equilibrated  with  50 
mmol/L  phosphate  pH  7.2,  100  mmol/L  NaCl,  and  100  pmol/L 
TCEP  (Amersham  Biosciences)].  Isotopically  enriched  LIMD2 
was  prepared  from  cells  grown  on  minimal  media  (supplemen¬ 
ted  with  100  pmol/L  of  ZnS04)  containing  1  g/L  of  [15N] 
ammonium  chloride  with  2  g/L  of  either  unlabeled  glucose  or 
[13C6]  glucose  (Cambridge  Isotopes  Laboratory).  The  ankyrin 
repeat  domain  (ARD;  l-174aa),  kinase  domains  (182-452aa)  of 
ILK  were  cloned  into  pET  32a  (Invitrogen)  and  pGEX-6Pl  (GE), 
respectively.  Growth  and  induction  were  carried  out  as 
described  above,  except  that  the  expression  temperature  was 
reduced  to  15°  C.  The  lysis  buffer  contained  50  mmol/L  phos¬ 
phate  pH  7.2, 200  mmol/L  NaCl,  10  mmol/L  p-mercaptoethanol, 
and  10%  glycerol.  The  protein  used  for  nuclear  magnetic 
resonance  (NMR)  was  characterized  by  analytical  ultracentri¬ 
fugation  and  circular  dichroism  to  determine  its  suitability  for 
NMR. 

Protein  concentrations  of  LIMD2  (300-400  pmol/L)  were 
used  for  NMR  experiments  for  resonance  assignment  and 
structure  calculation.  The  NMR  spectra  for  assignment  were 
collected  in  90%  H2O/10%  D20  containing  50  mmol/L  sodium 


phosphate,  pH  7.2,  100  mmol/L  NaCl,  and  1  mmol/L  TCEP  at 
600  MHz  on  a  Varian  INOVA  spectrometer  equipped  with  an 
HCN  cold  probe  at  25°  C.  Backbone  assignment  was  carried  out 
using  HNCACB  and  CBCA(CO)NH,  whereas  side  chain  assign¬ 
ments  were  determined  using  HCCH-TOCSY,  CC(CO)NH, 
HBHA(CO)NH,  as  well  as  2D  (HB)CB(CGCD)  HD  and  (HB) 
CB(CGCDCE)HE.  The  protonation  state  of  the  histidine  resi¬ 
dues  was  examined  using  a  modified  HMQC  experiment  (14). 
Distance  restraints  were  derived  from  15N-edited  NOESY  in 
90%  H2O/10%  D20,  and  13C-edited  NOESY  and  2D  NOESY  in 
D20.  The  spectra  were  processed  by  NMRPipe  (15)  and  ana¬ 
lyzed  by  SPARKY  (16).  The  molecular  dynamics  based  model  of 
LIMD2-ILK-Parvin  was  obtained  by  AutoDOCK  (17)  analysis 
of  fragmented  LIMD2  structure  against  that  of  the  pdb  struc¬ 
ture  3kmw  (ILK/a-Parvin),  rethreading  the  LIMD2  structure 
through  the  top  2  binding  sites  of  the  Autodock  analysis  and 
energy  minimizations  performed  in  water  at  a  pH  of  7.4. 

The  ILK  kinase  assay  was  performed  as  described  previously 
(18)  with  modifications.  ILK  and  LIMD2  were  mixed  and 
preincubated  for  30  minutes  at  room  temperature  before 
addition  of  the  substrate,  myosin  phosphatase  target  subunit 
1  (MYPT1;  Millipore).  The  amounts  of  ILK  and  MYPT1  were 
kept  constant  at  33.3  nmol/L  (50  ng/20  pL  reaction)  and  417 
nmol/L  (500  ng/20  pL  reaction),  respectively,  and  increasing 
amounts  of  LIMD2  were  added  as  indicated.  Kinase  reactions 
were  initiated  by  the  addition  of  ATP  to  a  final  concentration  of 
500  nmol/L.  Reactions  were  carried  out  in  1  x  kinase  reaction 
buffer  (25  mmol/L  Tris-HCl  pH  7.5,  5  mmol/L  (3-glyceropho- 
sphate,  2  mmol/L  dithiothreitol,  0.1  mmol/L  Na3V04, 10  mmol/ 
L  MgCl2)  for  30  minutes  at  30°  C,  terminated  with  the  addition 
of  sample  buffer  and  proteins  separated  by  SDS-PAGE.  ILK  and 
P-MYPT  were  assessed  by  Western  blot  analysis.  MYPT  con¬ 
centration  was  assessed  by  Coomassie  blue  staining. 

Results 

Characterization  of  LIMD2  protein  and  production  of 
aLIMD2  antibodies 

LIMD2  belongs  to  the  LIM-only  family  of  LIM  proteins, 
which  contain  between  1  and  5  LIM  domains  (Fig.  1A).  A 
distance-based  phylogenetic  analysis  of  amino  acid  sequences 
among  these  LIM-only  protein  family  members  was  performed 
using  MEGA  program  (Fig.  IB).  The  LIMD2,  CRP1,  and  LMOl 
segregate  from  FHL1  and  PINCH1,  indicating  that  they  are 
more  similar  to  one  another  than  they  are  to  FHL1  and  PINCH, 
which  are  more  closely  related,  implying  that  there  are  shared 
ancestral  relationships  between  the  individual  LIM  domains 
(Fig.  1C;  ref.  19).  To  define  the  targets  and  functions  of  LIMD2, 
rabbit  polyclonal  (rAb)  and  mouse  monoclonal  antibodies 
(mAb;  Supplementary  Fig.  SI  A)  were  produced  and  their 
specificity  was  determined.  Both  aLIMD2  (rAb)  and  aLIMD2 
(mAb)  detect  the  exogenous  and  endogenous  LIMD2  protein  in 
HEK293  and  U20S  cell  lines  (Fig.  ID)  but  do  not  interact  with 
AJUBA  andLIMDl  (Supplementary  Fig.  S1B-S1D).  By  transient 
transfection,  the  Myc-LIMD2  protein  [detected  by  both  Myc 
(rAb)  and  LIMD2  (mAb)]  is  found  predominantly  in  cytoplasm 
in  a  smooth  granular  distribution  (Fig.  IE),  a  result  confirmed 
by  subcellular  fractionation  (data  not  shown).  However, 
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LIMD2  is  concentrated  in  membrane  ruffles  and  in  streaks 
reminiscent  of  focal  adhesion  plaques  (Fig.  IE).  Interestingly, 
the  protein  is  not  excluded  from  the  nucleus,  consistent  with 
many  other  reports  of  nucleocytoplasmic  shuttling  by  LIM 
proteins.  LIMD2  is  robustly  detected  in  formalin  fixed,  paraf¬ 
fin-embedded  tissue  specimens  as  a  cytoplasmic  protein,  and 
no  signal  was  seen  in  cells  that  lack  LIMD2  (Supplementary 
Fig.  S1E).  Thus,  these  antibodies  are  specific  and  will  be  useful 
in  examining  the  role  of  LIMD2  in  cell  motility  and  tumor 
metastasis. 

LIMD2  transcript  and  protein  levels  correlate  with 
malignant  potential  in  breast,  bladder,  melanoma,  and 
thyroid  cancer  cell  lines  and  tumors 

LIMD2  levels  are  well  correlated  with  the  higher  degree  of 
invasiveness  among  immortalized  mammary  epithelial  cells 
(MCF-10A),  nonmetastatic  breast  cancer  cells  (MCF-7),  and 
breast  cancer  cells  (MDA-MB-231  and  MDA-MB-435;  Fig.  2A). 
The  LIMD2  RNA  levels  measured  by  quantitative  PCR  (qPCR) 


largely  correlate  with  the  protein  levels  (Supplementary  Fig. 
S2A).  MCF-7  cells  expressing  exogenous  LIMD2  were  obtained 
using  lentivirus  infection:  this  LIMD2  protein  was  readily 
detected  by  Western  blot  analysis  (Fig.  2B,  top).  LIMD2  expres¬ 
sion  stimulated  the  migration  of  the  MCF-7  cells  compared 
with  the  control  cell  line  (pLU  vector  control;  Fig.  2B,  bottom). 
Conversely,  the  knockdown  of  LIMD2  in  MDA-MB-231  cells 
strongly  impaired  migration  in  these  highly  aggressive  breast 
cancer  cells  (Fig.  2C).  Restoration  ofwtLIMD2  expression  in  Si- 
5  knockdown  MDA-MB-231  cells  (using  a  LIMD2  cDNA  with 
silent  mutations  in  the  coding  sequence  that  render  it  resistant 
to  the  LIMD2-Si  RNA)  resulted  in  a  regaining  of  the  same 
migration  ability  as  parental  cells  (Supplementary  Fig.  S3A  and 
S3B),  suggesting  that  the  effects  are  because  of  LIMD2  expres¬ 
sion  and  not  to  off-target  effects.  Depletion  of  LIMD2  strongly 
diminished  colony  formation  in  soft  agar  for  all  cell  lines,  but 
was  efficiently  restored  by  complementation  by  wtLIMD2 
expression  (Fig.  2E).  Major  differences  in  cell  morphology 
between  parental,  control  shRNA,  LIMD2-Si2,  and  LIMD2-Si5 
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Figure  2.  A,  breast  cancer  cell  lines 
express  endogenous  LIMD2.  B, 
ectopic  expression  of  LIMD2  in 
MCF-7  cells.  C,  siRNA-mediated 
LIMD2  knockdown  in  MDA-MB- 
231  cells  (the  Si2  and  Si5  are  2 
independent  cell  clones). 

Migration  potentials  of  each  cell 
line.  Data  are  presented  as  mean  ± 
SD.  T  test  was  performed  and  P 
value  is  indicated.  D,  morphology 
of  the  cells.  Bar,  100  |nm.  E,  soft 
agar  colony  formation  assays  of 
each  cell  line. 
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Figure  3.  A,  LIMD2  expression  in  a 
spectrum  of  melanoma  cancer 
cells  by  Western  blot  analysis.  B, 
knockdown  of  LIMD2.  C,  migration 
potentials  of  melanoma  cells.  The 
statistics  were  performed  and  P 
value  is  indicated.  D,  morphologic 
changes  in  melanoma  cancer 
cells.  Bars, 100  |um. 
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cells  were  also  observed:  the  vector  cells  exhibited  a  spindle¬ 
like  fibroblastic  morphology,  whereas  the  LIMD2-Si2  and 
LIMD2-Si5  cells  showed  an  enlarged  and  flattened  morphology 
(Fig.  2D).  LIMD2  knockdown  cells  are  enlarged  and  flattened  as 
indicated  with  white  arrows  (Supplementary  Figs.  S3C  and  S4). 
LIMD2  overexpression  restored  the  parental  cells  phenotype, 
showing  a  spindle-like  fibroblastic  morphology  as  indicated 
with  black  arrows  (Supplementary  Figs.  S3C  and  S4).  Thus, 
LIMD2  controls  multiple  hallmarks  of  oncogenic  growth: 
anchorage-independent  growth,  migration,  and  altered  cell 
morphology. 

LIMD2  was  analyzed  in  a  series  of  melanoma  [Fom  196-1 
cells  (normal  melanocytes),  RGP-WM35,  RGP-WM3211  cells 
(radial  growth-phase  melanoma),  VGP-WM793  cells  (vertical 


growth-phase  melanoma),  and  Metl205  Lu  cells  (metastatic 
melanoma);  Supplementary  Fig.  S2B].  LIMD2  protein  was 
undetectable  in  normal  melanocyte  Foml96-l  cells,  whereas 
the  expression  significantly  increased  in  the  RGP,  VGP,  and 
metastatic  melanoma  cells  in  a  somewhat  graded  fashion 
(Fig.  3A).  Following  introduction  of  knockdown  siRNA  len- 
tiviral  vectors  for  LIMD2,  protein  levels  and  the  migration 
rates  were  significantly  reduced  in  all  melanoma  cell  lines 
(Fig.  3B  and  C),  whereas  the  cell  morphology  was  widely 
variable  (Fig.  3D).  The  RGP  WM3211  cells  with  LIMD2 
knockdown  were  flat  compared  with  the  spindle-shaped 
parental  cells  (Fig.  3D,  RGP  WM3211  panels).  The  metastatic 
melanoma  cells  (Metl205  Lu)  morphology  was  changed 
dramatically  showing  a  highly  elongated  shape  reminiscent 
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of  neuronal  cells,  whereas  the  LIMD2-Si5  cells  became  flat 
and  larger.  Loss  of  LIMD2  significantly  reduced  Metl205  Lu 
cell  migration  (Fig.  3C). 

In  the  series  of  bladder  cancer  cells  [RT4,  HT1376,  UM-UC3, 
ScaBer,  J82,  TCCSUP,  and  T24  (listed  from  least  to  most 
aggressive)],  LIMD2  levels  coordinately  increased  with  the 
known  malignant  characteristics  of  these  cells  (Supplementary 
Figs.  S2C  and  S6A),  with  the  more  aggressive  cell  lines  showing 
higher  migration  rates  (Supplementary  Fig.  S6B  and  S6C). 
Ectopic  LIMD2  expression  in  the  more  benign  bladder  cancer 
cell  line  RT4,  which  showed  undetectable  endogenous  protein 
expression  levels  significantly  increased  migration  potential 
(Supplementary  Fig.  S6D),  indicating  that  the  gain-of-function 
LIMD2  promotes  bladder  cancer  cell  migration.  LIMD2  knock¬ 
down  in  T24  cells  efficiently  reduced  LIMD2  protein  levels  to 
almost  undetectable  levels  (Supplementary  Fig.  S6E),  and  these 
cells  showed  significant  decreased  numbers  of  colony  forma¬ 
tion  in  soft  agar  compared  with  the  parental  and  nontargeted 
siRNA  cells. 

Because  LIMD2  was  identified  from  a  papillary  thyroid 
cancer  metastasis,  we  evaluated  LIMD2  expression  in  thyroid 
carcinoma  cells  lines:  XTC-1,  FTC133,  FTC236,  FTC238,  and 
TPC-1.  LIMD2  levels  are  elevated  in  the  thyroid  papillary  cell 
line  TPC-1,  which  is  the  most  aggressive  thyroid  cancer  cell 
line  (Fig.  4A  and  Supplementary  Fig.  S2D)  and  shows  the 
highest  migration  rate  (Fig.  4B  and  C).  We  also  evaluated 
LIMD2  expression  in  primary  thyroid  cancer  specimens  using 
qPCR.  A  total  of  252  papillary  thyroid  cancer  samples  were 
evaluated  for  LIMD2  mRNA  level  by  qPCR.  We  found  that  the 
LIMD2  mRNA  level  was  significantly  higher  in  papillary  thyroid 
cancer  with  extrathyroidal  invasion  (P  =  0.001;  Fig.  4D  and  E). 
This  highly  significant  finding  further  underscores  the  role  of 
LIMD2  in  invasion  and  metastasis  processes. 

Structural  studies  into  LIMD2  give  insights  into 
molecular  function 

By  analogy  to  the  known  functions  of  other  LIM  domains, 
LIMD2  likely  binds  to  cellular  proteins.  We  sought  a  structural 
biology  solution  to  identify  its  targets  by  taking  advantage  of 
the  fact  that  LIMD2  is  well  folded,  globular,  monomeric,  and 
monodisperse  in  solution  at  high  concentrations,  making  it 
suitable  for  NMR.  Complete  backbone  assignments  of  the 
protein  were  determined  (Fig.  5A),  however  analysis  of  NOESY 
spectra  (13C-  and  15N-edited  NOESY-HSQC  as  well  as  2D- 
NOESY)  indicated  a  lack  of  signals  for  the  first  35  and  last 
23  amino  acids  of  the  protein  consistent  with  these  being 
unstructured.  Consequently,  only  residues  33  to  104  were  used 
for  structure  determination.  This  protein  is  composed  of  two 
treble  clef  fingers  (19, 20)  folded  around  two  zinc  atoms  (Fig.  5B 
and  C).  ICP-EOS  studies  also  indicate  that  there  are  2.4  zinc 
molecules  per  LIMD2  domain.  The  zinc-ligated  residues  are 
Cys40,  Cys43,  His61,  and  Cys64  (CCHC)  for  site  I,  and  Cys67, 
Cys70,  Cys88,  and  His91  (CCCH)  for  site  II.  The  first  finger  is 
composed  of  2  antiparallel  (3-sheets  with  zinc-coordinated 
residues  located  on  the  loop  connecting  two  strands  in  each 
antiparallel  (3- sheet.  The  second  finger  is  composed  of  a  long 
loop  followed  by  an  antiparallel  (3- sheet  and  ends  in  an 
approximately  2-turn  helix.  The  zinc-ligating  residues  in  the 


second  site  reside  on  the  long  loop  and  the  beginning  of  a  2- 
turn  helix  (Fig.  5B).  The  N51  atoms  of  the  2  histidines  (His61  and 
His91)  coordinate  the  zinc.  The  zinc-ligating  residues  comprise 
one  of  the  key  structural  elements  of  LIMD2.  In  addition, 
structural  analysis  indicates  that  there  is  a  small  hydrophobic 
core  likely  involved  in  the  stabilization  of  the  protein,  including 
the  aromatic  side  chains  of  residues  Phe60,  Phe65,  and  Phe86, 
which  form  a  hydrophobic  cluster  between  the  2  zinc  fingers 
and  are  buried  (Fig.  5E). 

We  assessed  the  structural  similarity  between  LIMD2  and 
other  reported  LIM  structures  present  in  the  PDB.  The 
superposition  of  LIMD2  onto  the  3-dimensional  crystal  struc¬ 
tures  of  PINCH1-LIM1  (PDB  code  3F6Q;  ref.  11),  and  PINCH2- 
LIM1  (PDB  code  3IXE;  refs.  21  and  22)  yielded  an  RMSD  of 
2.9  and  2.8  A,  respectively,  for  the  backbone  of  the  LIM 
consensus  sequence  (residues  39-96  in  LIMD2).  The  second¬ 
ary  structure  elements  are  well  conserved,  although  the 
antiparallel  (3-sheet  of  the  second  finger  of  the  LIMD2  adopts 
a  flat  conformation,  and  the  c'  strand  rotates  around  90° 
along  the  axis  of  the  sheet  compared  with  other  structures 
(Fig.  5C  and  D).  The  electrostatic  surface  of  LIMD2  is  very 
positive,  as  expected,  given  its  p/  of  8.7;  however,  the  charge 
distribution  is  quite  asymmetric,  with  both  positive  and 
hydrophobic  patches  (Fig.  5F).  Comparison  of  LIMD2  to 
PINCH-LIM1  reveals  that  although  the  latter  has  a  net  nega¬ 
tive  charge  (p/  of  4.9),  once  again  the  charge  distribution  is 
asymmetric  and  contains  hydrophobic  patches  (Supplemen¬ 
tary  Fig.  S5).  Similar  to  LIMD2,  PINCH-LIM4  is  a  positively 
charged  domain  (p/8.7);  however,  in  contrast  to  LIMD2,  the 
C-terminal  helix  is  both  negatively  and  positively  charged 
on  the  corresponding  faces  based  on  alignment  with  the 
LIMD2  domain,  respectively.  The  asymmetric  charge  distri¬ 
bution  and  positions  of  hydrophobic  patches  may  supply 
various  interfaces  to  accommodate  specific  binding  proteins. 

The  LIMD2  structure  was  highly  similar  to  the  first  LIM 
domain  (LIM1)  of  PINCH1.  PINCH1-LIM1  binds  to  the  ARD  of 
the  ILK  (11).  ILK  is  a  452  amino  acid,  multidomain  protein 
consisting  of  5  N-terminal  ankyrin  repeats,  and  a  C-terminal 
kinase  domain  (Figs.  6A  and  8B).  ILK  is  a  core  component  of  the 
IPP  complex,  which  contains  PINCH  proteins  and  Parvin 
(Fig.  8B).  The  IPP  complex  binds  to  the  C-terminal  tails  of 
(3l-integrins,  thus  serving  to  communicate  cell-cell  and  cell- 
matrix  adhesion  signals  to  the  nucleus  (23).  To  determine  if 
PINCH1-LIM1  and  LIMD2  bound  to  ILK,  we  used  GST-LIMD2, 
GST-PINCH1-LIM1  and  35S-labeled  ILK  protein  produced  by 
IVT.  Indeed,  the  GST-LIMD2  protein  robustly  binds  to  ILK 
under  the  same  conditions  as  PINCH-LIM1;  neither  protein 
binds  to  GST  alone  (Fig.  6B).  Identical  results  were  observed  for 
ILK  and  LIMD2  interaction  using  fully  recombinant  proteins, 
isolated  from  E.  coli  (Fig.  6C).  These  studies  strongly  suggest 
that  ILK  is  a  candidate  target  protein  for  binding  by  LIMD2. 
GST-PINCH1-LIM1  bound  to  the  ARD  robustly,  as  has  been 
shown  by  other  investigators  (11),  but  the  LIMD2  protein  did 
not  bind  to  the  ILK- ARD  (data  not  shown).  Moreover,  LIMD2 
did  not  compete  for  PINCH  1 -LIM  1  for  binding  to  ILK  (data  not 
shown). 

To  determine  if  endogenous  cellular  ILK  is  required  for 
stimulation  of  migration  by  LIMD2,  we  used  a  matched  set  of 
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Figure  4.  A,  LIMD2  is  highly 
expressed  in  thyroid  cancer  cells. 
B,  migration  potentials  of  the 
thyroid  cancer  cells.  Data  are 
presented  as  mean  ±  SD.  C, 
images  of  thyroid  carcinoma  cell 
line  XTC-1  and  thyroid  papillary 
carcinoma  cell  line  TPC-1  taken 
from  the  migration  assay.  Bar,  1 00 
|Lim.  D,  LIMD2  mRNA  is  higher  in 
thyroid  cancer  samples  with 
extrathyroidal  invasion 
(n  =  49).  Total  RNA  was  extracted 
from  252  papillary  thyroid  cancer 
samples  using  the  TRIzol  reagent 
(Invitrogen).  TaqMan  gene 
expression  assay  (Applied 
Biosystems)  was  used  to 
quantitate  LIMD2  mRNA 
expression  levels.  E, 
representative  immunostaining 
results  in  papillary  thyroid  cancer 
samples  using  the  aLIMD2  (mAb). 
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fibroblasts  that  are  either  wild  type  for  ILK  (+/ +)  or  have  been 
rendered  ILK  null  (— /— )  via  CRE-mediated  excision  of  floxed 
ILK  alleles  (24).  Both  of  these  cell  types  grow  robustly  in 
culture,  show  similar  morphologies,  similar  doubling  times, 
and  are  equally  infectable  by  lentiviruses  (Fig.  6D).  The  —  /— 


cells  lack  ILK  by  Western  blot  analysis  as  expected  (Fig.  6E). 
LIMD2  is  robustly,  and  equally  expressed  in  both  cell  types  (Fig. 
6E)  when  infected  with  lentiviruses.  LIMD2  expression  in 
ILK+/+  cells  stimulates  cell  migration  almost  4-fold.  However, 
there  is  no  stimulation  of  migration  by  LIMD2  in  cells  lacking 
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Figures.  A,  1H-15N  HSQC 
spectrum  of  LIMD2  with  backbone 
assignment:  BMRB  accession 
number  18778.  Purple  represents 
aliased  signals.  B,  superimposition 
of  the  1 0  best  structures  calculated 
by  ARIA  (PDB  accession  code 
2LZU)  with  an  RMSD  of  0.32  A  for 
LIMD2  residues  39-96aa  over 
backbone  atoms.  See 
Supplementary  Table  SI  for 
structure  statistics.  Zinc  is  colored 
in  gold  and  side  chains  of  the  zinc- 
coordinated  residues  are  shown  in 
green.  C,  ribbon  representation  of 
LIMD2.  D,  alignment  of  LIMD2  (red) 
with  PINCH1-LIM1  (cyan  PDB 
code  3F6Q),  PINCH2-LIM1  (green 
PDB  code  3IXE),  and  PINCH1- 
LIM4  (orange  PDB  code  1NYP).  E, 
side  chains  of  F60,  F65,  and  F86 
from  a  LIMD2  hydrophobic  core.  F, 
electrostatic  surface  of  LIMD2.  The 
asymmetric  charge  distribution  in 
LIMD2  is  shown. 


the  — /—  cells  that  lack  ILK  (Fig.  6F).  Thus,  ILK  is  likely  a  key 
effector  for  LIMD2  stimulated  cell  migration. 

Because  LIMD2  bound  tightly  to  the  region  of  ILK  containing 
the  kinase  domain,  we  determined  if  LIMD2  could  alter  the 
kinase  activity  of  ILK  toward  a  model  peptide  substrate  in  vitro . 
Although  there  is  considerable  controversy  in  the  literature 
about  whether  ILK  is  a  kinase  or  a  pseudo-kinase,  the  obser¬ 
vation  that  baculovirus-expressed  and  purified  protein  contains 
this  activity  warranted  a  look  at  this  question:  we  engaged  the 
investigators  who  have  championed  this  claim  to  perform  these 
experiments.  Baculovirus-produced  full-length  ILK  (Fig.  7A) 
showed  considerable  kinase  activity  toward  the  MYPT  peptide 
as  has  previously  been  shown  (18).  The  addition  of  LIMD2  to  the 
in  vitro  assay  resulted  in  a  dose-dependent  increase  in  kinase 


activity,  showing  almost  a  3-fold  stimulation  at  the  highest 
levels  of  LIMD2  added  (Fig.  7B-D).  This  apparent  activation  of 
ILK  by  LIMD2  was  abolished  when  the  ILK  small-molecule 
inhibitor  kinase  inhibitor  QLT0267  was  added  to  the  reaction 
(Fig.  7E).  Thus,  binding  of  ILK  by  LIMD2  in  vitro  results  in 
stimulation  of  the  kinase  activity  exhibited  by  the  recombinant 
ILK  under  these  conditions.  Together,  these  data  strongly 
suggest  that  LIMD2  is  a  new  positive  regulator  of  ILK  activity. 

Discussion 

Characterization  of  the  novel,  metastasis-associated  LIMD2 
protein  described  herein  supports  the  following  conclusions: 
(i)  LIMD2  is  a  small  cytoplasmic  protein  whose  endogenous 
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Figure  6.  LIMD2  and  PINCH1 
directly  associate  with  ILK,  LIMD2 
affects  cell  migration  through  ILK. 
A,  domain  architecture  of  ILK.  The 
N-terminus  of  ILK  contains  ankyrin 
repeat  domain  (ARD)  and  the  C- 
terminus  of  ILK  contains  a  kinase 
domain.  B,  ILK  protein  binds  to 
GST-LIMD2  and  GST-PINCH1- 
LIM1  but  not  GST  alone.  C,  LIMD2 
protein  binds  to  GST-ILK  (182- 
452aa).  D,  LIMD2  stimulates  cell 
migration  in  ILK+/+  fibroblast  but 
not  in  ILK-/~  fibroblast.  Lentivirus 
pLU  vector  (GFP)  and  pLU-LIMD2 
were  infected  on  ILK+/+  and  ILK  7 
fibroblast  cells.  Migration  assay 
was  performed  on  pLU  and  LIMD2 
virus  infected  ILK+/+  and  ILK_/_ 
fibroblast  cells.  Images  were  taken 
from  the  migrated  cells.  Bar,  100 
|iim.  E,  endogenous  ILK  protein 
were  detected  in  ILK+/+  fibroblast 
cells  but  not  in  ILK  7  cells  by 
Western  blot  analysis  using  aILK 
(rAb).  Exogenous  LIMD2  proteins 
were  detected  in  LIMD2  virus- 
infected  fibroblast  cells  by 
Western  blot  analysis  using 
aLIMD2  (mAb).  F,  ectopic  LIMD2 
expression  promotes  ILK+/+ 
fibroblast  cells  migration.  Three 
independent  experiments  were 
performed.  Data,  mean  ±  SD. 

P  value  is  indicated. 
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expression  levels  correlate  with  the  known  malignant  proper¬ 
ties  of  cells  derived  from  breast,  melanoma,  bladder,  and 
thyroid  cancers;  (ii)  ectopic  expression  or  knockdown  of 
LIMD2  stimulates  or  abrogates  cell  invasiveness,  motility,  and 
cell  morphology  consistent  with  tumor  grade;  (iii)  primary 
human  thyroid  cancers  that  express  LIMD2  show  much  greater 
extrathymic  invasion;  (iv)  the  LIMD2  structure  is  homologous 
to  the  PINCH1-LIM1  domain  that  binds  to  ILK;  (v)  both 
PINCH  1-LIM1  and  LIMD2  proteins  bind  to  ILK  directly  but 
at  nonoverlapping  sites;  (vi)  LIMD2  stimulates  the  kinase 
activity  of  ILK,  and  endogenous  ILK  is  required  for  the  ability 
of  LIMD2  to  stimulate  cell  migration.  Thus,  in  LIMD2,  we  have 
discovered  a  new  target  and  function  for  a  LIM-only  protein  in 


tumor  progression  using  a  combination  of  tumor  biology  and 
structural  biology  experimental  approaches. 

The  finding  of  a  direct  connection  between  LIMD2  and  the 
integrin  signaling  apparatus  is  wholly  consistent  with  the 
promotion  of  cell  migration  and  invasion  seen  in  LIMD2- 
expressing  cell  lines  and  tumors.  The  multiprotein  IPP  com¬ 
plex  (ILK-PINCH-Parvin)  as  a  core  complex  links  transmem¬ 
brane  integrin  adhesion  receptors  to  the  actin  cyto skeleton 
and  intracellular  signaling  cascades  (11,  25).  The  ILK  localizes 
to  sites  of  integrin-mediated  cell  adhesion  (26,  25,  27).  The 
presence  of  intrinsic  catalytic  activity  as  a  kinase  in  ILK  is  still 
controversial,  but  the  domain  clearly  binds  to  the  CH2  domain 
of  Parvin  (28).  Formation  of  the  tripartite  IPP  complex  is 
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Figure  7.  LIMD2  increases  ILK 
kinase  activity  in  a  dose-dependent 
manner.  A,  coomassie  blue- 
stained  polyacrylamide  gel 
showing  highly  purified  ILK-GST 
fusion  protein  used  in  the  ILK 
kinase  assay.  B,  dose-dependent 
phosphorylation  of  the  ILK 
substrate  myosin  phosphatase 
target  subunit  1  (MYPT1)onThr696 
in  the  presence  of  increasing 
concentrations  of  purified  LIMD2. 
ILK  and  LIMD2  were  mixed  and 
preincubated  for  30  minutes  at 
room  temperature  before  addition 
of  the  substrate.  The  amounts  of 
I LK  and  M  YPT 1  were  kept  constant 
at  50  and  500  ng/reaction, 
respectively,  and  increasing 
amounts  of  LIMD2  were  added  as 
indicated.  C,  concentration  curve 
similar  to  that  shown  in  B,  with 
LIMD2  concentrations  increased 
stepwise  from  5  to  50  ng. 
Reactions  were  carried  out  as 
described  for  B.  D,  concentration 
curve  similar  to  that  shown  in  C, 
with  LIMD2  concentrations 
increased  stepwise  from  1  to  10 
and  20  ng  (molar  ratio  of  0.1-1 .9). 
Reactions  were  carried  out  as 
described  for  B.  Data  in  the  bar 
graph  are  normalized  to  the  signal 
observed  in  the  presence  of  ILK 
alone.  E,  effect  of  the  specific  small 
molecule  inhibitor  of  ILK,  QLT0267, 
on  the  phosphorylation  of  M  YPT  by 
ILK-LIMD2.  ILK-LIMD2  complexes 
were  preincubated  at  the  indicated 
concentrations  as  described 
above,  and  QLT0267  was  added  at 
a  final  concentration  of  1 .0  (imol/L 
just  before  initiation  of  the  kinase 
reaction.  Data  in  the  bar  graph  are 
normalized  to  the  signal  observed 
in  the  presence  of  ILK  alone. 


absolutely  required  for  integrin  signaling,  migration,  and  inva¬ 
sion  in  both  normal  developmental  and  malignant  programs. 
The  PINCH1  LIM1  domain  binds  directly  to  the  ARDs  of  ILK 
and  stimulates  integrin  signaling  (whereas  the  highly  related 
PINCH2  protein  binds  to  the  ILK  ARD  and  competes  for 
PINCH1  binding;  Figs.  22  and  29-31).  Other  proteins  that  bind 
to  the  core  IPP  complex,  such  as  Nek,  ERK1/2,  GSK3p,  and 
Rsul,  serve  to  remodel  the  IPP  complex  to  respond  to  different 
growth  factor  signaling  inputs  emanating  from  the  cell  mem¬ 
brane  (32)  and  largely  control  cell  growth  and  differentiation 
pathways,  most  notably  EMT.  In  addition  to  the  direct  effects 
of  LIMD2  on  ILK  kinase  function  as  described  below,  LIMD2 
analogously  may  adapt  the  IPP  complex  to  other  as  yet 
undefined  inputs. 

The  finding  that  LIMD2  binds  to  ILK  and  promotes  cell 
migration  and  invasion  is,  of  course,  inconsistent  with  it  being 
a  competitor  for  PINCH1 LIM1  binding,  even  though  they  share 
high  structural  similarity.  That  LIMD2  binds  to  ILK  at  a  site 
distinct  from  PINCH  1  is  consistent  with  a  model  that  LIMD2 


may  be  a  new  positive  regulator  of  the  IPP  complex  (Fig.  8B).  In 
fact,  LIMD2  and  PINCH  1  likely  bind  simultaneously  to  ILK  at 
widely  dispersed  sites.  At  first  blush,  it  may  seem  surprising 
that  LIMD2  and  PINCH1  would  target  separate  regions  of  ILK; 
however,  given  the  vastly  different  charge  distributions  on  the 
same  structural  scaffold  (Fig.  5F),  it  is  reasonable  that  these 
would  bind  different  surfaces  (Supplementary  Fig.  S5).  In 
contrast,  our  data  are  consistent  with  binding  of  LIMD2  near 
the  active  site  and  thus  directly  influencing  its  enzyme  activity. 
An  in  silico  molecular  dynamics-based  docking  analysis 
strongly  suggests  that  LIMD2  binds  to  a  site  in  ILK  very  near 
the  potential  active  site  but  separate  from  the  binding  site  for 
a-Parvin  (Fig.  8A).  It  is  noteworthy  that  when  ILK  is  coex¬ 
pressed  with  a-Parvin  in  bacteria,  the  complex  is  enzymatically 
dead  and  in  fact  a-Parvin  is  an  inhibitor  of  kinase  activity  when 
added  to  in  vitro  ILK  kinase  assays.  Thus,  a-Parvin  and  LIMD2 
may  serve  as  negative  and  positive  regulators  (respectively)  of 
kinase  activity  by  binding  the  2  different  lobes  of  the  kinase 
domain,  which  border  the  active  site  cleft  (Fig.  8A).  The  model 
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Figure  8.  A,  computer  molecular  modeling  of  the  LIMD2-ILK-aParvin 
protein  complex.  B,  a  proposed  working  model  for  LIMD2  functional  roles 
in  assembly  of  and  normal  function  of  the  ILK-PINCH-Parvin  complex  in 
the  integrin-linked  signaling  pathway. 

also  suggests  that  LIMD2  forms  a  well-defined  pocket/groove 
bordered  by  the  2  zinc-binding  fingers,  which  directly  interact 
with  an  active  site  loop  in  ILK:  this  pocket  may  be  a  promising 
target  for  small  molecule-based  inhibitor  design. 

Like  LIMD2,  almost  all  components  of  the  IPP  complex  have 
been  observed  to  be  deregulated  in  human  cancer  (32)  and  are 
largely  correlated  with  the  later  metastatic  phenotype.  PINCH1 
expression  is  highly  correlated  with  tumor  grade  and  invasive¬ 
ness  in  gliomas,  gastric  cancer,  and  squamous  cell  carcinoma, 
suggesting  that  LIMD2  overexpression  may  phenocopy  PINCH1 
overexpression  (33).  It  may  be  that  optimal  activation  of  ILK 
requires  binding  of  both  PINCH1  and  LIMD2.  The  negative 
regulator  of  IPP,  PINCH2,  is  often  silenced  by  methylation  in 
tumors.  Moreover,  the  PINCH1  LIM  domain  5  binds  directly  to 
an  altered  form  of  the  Ras  suppressor,  Rsul,  thereby  directly 
participating  in  Ras-mediated  transformation  (33).  Thus,  IPP 
deregulation  is  a  common  denominator  among  tumors  arising 
from  different  lineages,  which  supports  the  observations  that 
LIMD2  expression  occurs  in  multiple  tumor  types. 

Does  LIMD2  function  as  a  classic  "adaptor"  protein  as  has 
been  demonstrated  in  other  LIM-only  proteins?  One  of  the  best 
examples  of  this  function  is  the  LIM-only  protein  LM02,  which 


uses  its  2  LIM  domains  to  bind  the  GATA1  and  TALI  tran¬ 
scription  factors  in  the  nucleus  (34, 35).  In  this  model,  each  LIM 
domain  in  LM02  is  specific  for  each  partner.  Similar  specificity 
for  protein-protein  interactions  are  demonstrated  by  the  indi¬ 
vidual  LIM  domains  in  Paxillin,  TRIP6,  Ajuba,  and  of  course 
PINCH1,  whose  5  LIM  domains  are  likely  specialized  for 
different  partners.  Thus,  LIM  domain  proteins  with  2  or  more 
LIM  domains  function  as  true  scaffolds  to  combinatorially 
assemble  multiprotein  complexes  (10).  The  case  of  LIMD2  is 
problematic  to  propose  as  a  classic  scaffold  in  that  it  encodes 
only  a  single  LIM  domain.  However,  the  unstructured  NH2-  and 
COOH-terminal  protein  sequence  in  LIMD2  may  provide  this 
function.  The  pronounced  charge  differences  observed  on  each 
"face"  of  the  LIMD2  LIM  domain  may  indicate  that  the  LIM 
domain  itself  can  provide  separate  surfaces  for  protein-pro¬ 
tein  interactions.  It  is  formally  possible  that  LIMD2  functions 
as  a  positive  regulator  of  the  IPP  complex  by  simply  using  its 
single  LIM  domain  to  compete  for  a  negative  regulator  of 
ILK.  LIMD2  may  also  directly  modulate  the  activity  of  the 
kinase  domain  of  ILK.  It  is  noteworthy  that  LIMD2  has  not 
been  observed  in  multiple  proteomics-based  approaches  to 
the  composition  of  the  ILK  complex  and  the  proteins,  which 
inhabit  focal  adhesion  plaques  (36).  This  is  surprising  as  our 
preliminary  analyses  of  LIMD2  distribution  in  cells  by  con- 
focal  microscopy  show  it  to  partially  overlap  with  focal 
adhesion  streaks,  and  the  fact  that  LIMD2  seems  to  bind 
ILK  directly  and  with  considerable  affinity  using  recombi¬ 
nant  proteins.  An  ongoing  dynamic  approach  using  video¬ 
microscopy  of  genetically  tagged  fluorescent  proteins  should 
address  this. 

Does  LIMD2  carry  out  multiple  independent  functions  in 
different  subcellular  compartments?  Many  multi-LIM 
domain  proteins  shuttle  between  the  cytoplasm  and  nucle¬ 
us.  These  studies  were  initiated  by  the  pioneering  work  of 
Beckerle  showing  that  the  LIM  domain  protein  paxillin 
rapidly  shuttled  between  the  cytoplasm  and  the  nucleus 
and  had  key  regulatory  roles  in  both  compartments  (19,  37). 
LIMD2  may  function  like  its  closest  evolutionary  homologue, 
CRP1.  CRP1  is  a  2  LIM  protein  that  shuttles  between  the 
cytoplasm  and  nucleus.  In  the  nucleus  it  binds  a  complex  of 
the  SRF  and  GATA  transcription  factors,  whereas  in  the 
cytoplasm,  CRP1  binds  to  a-actinin  and  zyxin  to  reorganize 
the  cytoskeleton  (38).  The  Ajuba  LIM  domain  protein  illus¬ 
trates  a  similar  pathway  highly  relevant  to  cancer  progres¬ 
sion.  Our  laboratory  has  shown  that  the  3  LIM  domain 
protein  Ajuba  binds  to  the  SNAG  repression  domain  in  the 
snail/slug  transcription  factors  and  plays  a  key  role  in  the 
EMT  program  during  tumor  progression  by  repressing 
E-cadherin  and  other  genes  (39,  40).  However,  when  in  the 
cytoplasm,  Ajuba  is  found  at  cell  adhesion  junctions  and 
binds  to  CTNNA,  F-actin,  and  GRB2.  It  is  noteworthy  that 
PINCH1  can  be  found  in  both  the  cytoplasm  and  the  nucleus, 
and  this  location  has  prognostic  value  in  primary  human 
tumors  (33). 

In  summary,  the  study  of  LIMD2  as  a  marker  identified  in  a 
simple  profiling  study  of  primary  and  metastatic  tumors  has 
yielded  a  new  player  in  the  ILK  signaling  pathway.  It  is 
conceivable  that  small  molecule  approaches  to  disrupting 
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LIMD2  binding  to  its  partners  could  influence  the  metastatic 
process. 
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Snail  Recruits  Ringl  B  to  Mediate  Transcriptional  Repression 
and  Cell  Migration  in  Pancreatic  Cancer  Cells 

Jiangzhi  Chen1,2,  Hong  Xu1,  Xiuqun  Zou1,  Jiamin  Wang1,  Yi  Zhu1,  Hao  Chen1,  Baiyong  Shen1, 
Xiaxing  Deng1,  Aiwu  Zhou3,  Y.  Eugene  Chin3,  Frank  J.  Rauscher,  III4,  Chenghong  Peng1,  and 
Zhaoyuan  Hou1,3 


Abstract 

Transcriptional  repressor  Snail  is  a  master  regulator  of  epithelial-mesenchymal  transition  (EMT),  yet  the 
epigenetic  mechanism  governing  Snail  to  induce  EMT  is  not  well  understood.  Here,  we  report  that  in 
pancreatic  ductal  adenocarcinoma  (PDAC),  elevated  levels  of  the  ubiquitin  E3  ligase  RinglB  and  Snail,  along 
with  elevated  monoubiquitination  of  H2A  at  K119  (H2AK119Ubl),  are  highly  correlated  with  poor  survival. 
Mechanistic  investigations  identified  RinglB  as  a  Snail-interacting  protein  and  showed  that  the  carboxyl  zinc 
fingers  of  Snail  recruit  RinglB  and  its  paralog  RinglA  to  repress  its  target  promoters.  Simultaneous  depletion 
of  RinglA  and  RinglB  in  pancreatic  cancer  cells  decreased  Snail  binding  to  the  target  chromatin,  abolished 
H2AK119Ubl  modification,  and  thereby  compromised  Snail-mediated  transcriptional  repression  and  cell 
migration.  We  found  that  RinglB  and  the  SNAG-associated  chromatin  modifier  EZH2  formed  distinct  protein 
complexes  with  Snail  and  that  EZH2  was  required  for  Snail-RinglA/B  recruitment  to  the  target  promoter. 
Collectively,  our  results  unravel  an  epigenetic  mechanism  underlying  transcriptional  repression  by  Snail, 
suggest  Ringl A/B  as  a  candidate  therapeutic  target,  and  identify  H2AK119Ubl  as  a  potential  biomarker  for 
PDAC  diagnosis  and  prognosis.  Cancer  Res;  74(16);  4353-63.  ©2014  AACR. 


Introduction 

Snail  is  a  member  of  SNAG  domain  containing  zinc  finger 
proteins  and  a  master  regulator  of  epithelial-mesenchymal 
transition  (EMT)  and  metastasis  in  various  tumor  types  (1-3). 
Snail  can  directly  bind  to  the  E-boxes  of  E-cadherin  gene 
promoter  to  repress  its  transcription  and  convert  normal 
epithelial  cells  into  mesenchymal  cell  phenotype  (4,  5).  Mech¬ 
anistically,  Snail  recruits  multiple  repressive  protein  complexes 
involving  histone  deacetylation  and  methylation  as  well  as 
DNA  methylation  to  its  target  promoters  and  exerts  its  repres¬ 
sive  function  (6-10).  However,  how  these  protein  complexes 
are  assembled  at  the  target  chromatin  regions  remains  elusive. 
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RinglA  and  RinglB  belong  to  the  RING  domain  containing 
ubiquitin  E3  ligase  family  and  are  crucial  components  of  the 
polycomb  repressive  complex  1  (PRC1)  by  catalyzing  mono¬ 
ubiquitination  of  histone  H2A  at  lysine  119  (H2AK119Ubl; 
ref.  11).  H2AK119Ubl  status  is  associated  with  gene  silencing 
(12),  chromatin  remodeling  (13),  and  X  chromosome  inactiva¬ 
tion  (14).  Genetic  disruption  of  RinglB  in  mice  causes  embry¬ 
onic  lethality  due  to  gastrulation  arrest  and  defective  mesoderm 
formation  (15),  which  are  reminiscent  of  the  Snail  null  mice. 
Snail-deficient  mouse  embryos  also  die  early  in  gestation, 
displaying  defects  in  gastrulation  and  mesoderm  formation 
(16,  17).  The  striking  genetic  evidence  strongly  indicates  that 
Snail  very  likely  correlates  with  RinglB  in  regulation  of  gastru¬ 
lation  and  mesoderm  formation  during  embryo  development. 

A  recent  study  found  an  elevated  RinglB  expression  in  high- 
grade  pancreatic  ductal  adenocarcinoma  (PDAC;  ref.  18);  how¬ 
ever,  no  biologic  and  mechanistic  studies  on  RinglB  were 
performed.  In  this  present  study,  we  report  that  RinglA  and 
RinglB  interact  with  Snail  and  are  important  coregulators  for 
Snail-mediated  transcriptional  repression  and  cell  migration 
in  pancreatic  cancer  cells.  Snail  enhances  the  binding  of 
RinglA  and  RinglB  to  the  promoter  region  of  E-cadherin  to 
increase  H2AK119Ubl  at  this  locus,  and  an  elevated  Snail, 
RinglB,  and  H2AK119Ubl  modification  in  PDAC  is  highly 
correlated  with  poor  prognosis. 

Materials  and  Methods 

IHC  and  tissues  microarray 

IHC  staining  was  performed  as  described  (18)  with  specific 
antibodies  against  Snail,  RinglB,  or  H2AK119Ubl.  Tissue 
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microarray  (TMA)  chips  that  contain  90  cases  of  paired 
tumor  and  peri-tumor  specimens  were  purchased  (HPan- 
Adel80  Sur-01;  ShGnghGi  Outdo  Biotech  Company).  All 
specimens  spotted  on  TMA  chips  were  well  documented 
including  complete  postoperative  fellow-ups  for  periods  from 
3  to  7  years.  Tumor  staging  was  evaluated  according  to  the 
TNM  Classification  of  Malignant  Tumors.  After  IHC  staining, 
all  specimens  were  strictly  evaluated  by  a  senior  pathologist 
and  only  those  with  a  tumor  content  of  more  than  50%  were 
considered  as  tumor  samples  and  the  peri-tissues  showing 
sign  of  chronic  pancreatitis  were  excluded,  and  total  of  60 
cases  are  qualified  for  further  analysis.  Peri-tumor  tissues 
were  defined  as  that  more  than  1  cm  distant  from  the  tumor 
edge.  TMA  chips  were  scanned  by  Aperio  Scanscope  XT,  and 
the  whole  field  of  each  tissue  spot  was  obtained  for  IHC 
evaluation.  The  immunore active  score  system  (IRS;  refs.  19, 
20)  was  used  to  evaluate  the  staining  of  each  sample  as 
follow:  negative,  0  to  1  point;  mild  (+),  2  to  3  points;  moderate 
(++),  4  to  8  points;  strongly  positive  (+++),  9  to  12  points. 
IHC  staining  showing  IRS  scores  of  more  than  4  points  was 
considered  as  high. 

Plasmids 

RinglA  and  RinglB  cDNAs  and  their  mutants  were  cloned 
into  pCMV4-Flag  vector  between  Hind  III  and  Eco  RI  sites.  Snail 
and  its  mutants  were  cloned  into  pCMV5-HA  vector  between 
Hindlll  andXbal  sites.  pLKO.l-shRNAs  targeting  RinglA  were 
ATAGATCTTAGAGATCAGGGC  and  ATCGTTGTGGTCTGA- 
TCTGAC;  targeting  RinglB  were  ATTGTGCTTGTTGAT- 
CCTGGC  and  TTCTAAAGCTAACCTCACAGC,  respectively. 
All  point  mutants  were  made  using  the  QuikChange  Site- 
Directed  Mutagenesis  procedures  (Stratagene),  and  were  con¬ 
firmed  by  DNA  sequencing. 

Cell  culture  and  transfections 

HEK-293T  cells  and  pancreatic  cancer  cells  PanCl  and 
AsPCl  were  obtained  from  the  ATCC  and  were  tested  and 
authenticated  by  DNA  typing  at  the  Shanghai  Jiao  Tong 
University  Analysis  Core.  The  cells  were  maintained  in  DMEM 
supplemented  with  10%  FBS,  2  mmol/L  L-glutamine,  and 
penicillin  (50  U/mL)/streptomycin  (50  pg/mL)  at  37°  C  under 
5%  C02  in  a  humidified  chamber. 

Transfection  of  PanCl  and  HEK-293T  cells  was  performed 
using  Lipofectamine  2000  as  described  (8).  The  viral  super¬ 
natants  were  generated  in  HEK-293T  cells,  and  were  infected 
into  PanCl  and  AsPCl  cells.  Puromycin  was  added  into  the 
media  to  generate  stable  knockdown  of  RinglA  and  RinglB  in 
PanCl  and  AsPCl  cells.  FACS  was  performed  to  sort  the  cells 
stably  expressing  Flag-Snail. 

Affinity  purification  of  Snail-interacting  protein 
complex 

A  Flag-tagged,  full-length  Snail  cDNA  in  the  pcDNA3.1- 
vector  was  stably  expressed  in  HEK-293T  cells.  Single-cell 
clones  were  selected  with  G418  and  screened  by  Western  blot 
assays  using  anti-Flag  antibody.  The  method  used  for  affinity 
purification  was  previously  described  (8).  A  total  of  5  x  109  cells 
were  used  for  affinity  purification,  and  the  eluted  proteins  were 


resolved  on  4%  to  12%  SDS-PAGE  gels  (Invitrogen)  for  Western 
blot  and  colloidal  staining  analyses.  The  proteins  were  excised 
from  the  gel  and  identified  by  standard  mass  spectrometry. 

Coimmunoprecipitation,  Western  blot, 
immunofluorescence,  and  antibodies 

Plasmids  encoding  Flag-RinglA,  Flag-RinglB,  hemaggluti¬ 
nin  (HA)-Snail  proteins  were  transiently  expressed  in  HEK- 
293T  cells,  and  24  hours  after  transfection,  cells  were  lysed  in 
buffer  containing  20  mmol/L  Tris-HCl  (pH  8.0),  150  mmol/L 
NaCl,  2.5  mmol/L  EDTA,  0.5%  NP40, 0.1  mmol/L  phenylmethyl- 
sulfonylfluoride,  and  protease  inhibitor  cocktail.  Method  for 
total  histones  extraction  was  as  described  (12).  The  whole¬ 
cell  extracts  were  precleared  with  protein  A/G  beads,  and 
coimmunoprecipitation  (co-IP)  assays  were  performed  with 
either  Flag  or  HA  antibodies.  The  methods  used  for  Western 
blot  and  immunofluorescence  were  previously  described  (8). 
Antibodies  for  Flag  (Sigma-Aldrich;  F  7425),  HA  (COVANCE; 
MMS-101P),  RinglA,  RinglB,  H2A,  ubiquityl-Histone  H2A- 
lysll9  and  E-cadherin  (Cell  Signaling  Technology;  #2820, 
#5694, #2578, #8240,  #3195),  Snail  (Santa  Cruz;  sc-28199);  and 
(3-actin  (Proteintech;  60008-  1-Ig)  were  purchased. 

Chromatin  immunoprecipitation  and  qPCR 

The  chromatin  immunoprecipitation  (ChIP)  experiments 
were  carried  out  in  PanCl  cells  and  derivatives.  To  prepare 
cells  for  ChIP  assays,  the  PanCl  cells  were  grown  in  10  cm 
plates  to  70%  to  90%  confluency  and  were  processed  as 
described  (8).  The  immunoprecipitated  DNA  fragments  were 
detected  by  qPCR  assays.  The  primer  sets  that  amplify  the  DNA 
fragment  flanking  the  known  E-boxes  in  the  E-cadherin  pro¬ 
moter  are  as  follows:  forward,  5-GCAGGTGAACCCTCAGC- 
CAA-3;  reverse,  5-CACAGGTGCTTTGCAGTTCC-3. 

Total  RNA  was  isolated  from  cells  with  TRIzol  reagent 
(Invitrogen).  qRT-PCR  was  performed  on  a  7500  Fast  Real¬ 
time  PCR  system  (Applied  Biosystem)  using  SYBR  Green  agent. 
Primers  used  for  qRT-PCR  assay  were  listed  in  Supplementary 
information.  All  RT-PCR  assays  were  repeated  three  times. 

Transwell  cell  migration  assays 

PanCl  cells  were  harvested  after  serum-free  starvation  for  12 
hours,  and  were  resuspended  in  plain  DMEM  media.  Ten 
thousand  cells  were  applied  to  8-pm  pore  transwell  filters 
(Corning).  DMEM  media  containing  10%  FBS  were  added  to 
the  bottom  chamber  as  attractants.  After  incubation  for  24 
hours,  the  nonmigrated  cells  at  the  top  of  the  filter  were 
removed  and  the  migrated  cells  at  the  bottom  of  the  filter 
were  fixed  with  4%  paraformaldehyde  and  were  stained  with 
colloidal  staining  method.  The  number  of  migrating  cells  in 
each  chamber  was  quantified  by  counting  nine  randomly 
chosen  fields  under  x20  magnification  using  a  bright-field 
microscope.  Each  condition  was  performed  in  duplicate,  and 
the  average  number  of  cells  per  field  was  represented.  Experi¬ 
ments  were  repeated  three  times. 

Statistical  analysis 

Data  shown  as  mean  ±  SD  were  analyzed  by  the  indepen¬ 
dent  Student  t  test.  The  distribution  of  the  IHC  scoring  results 
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of  each  protein  on  TMA  chips  was  analyzed  by  the  McNemar 
test.  The  correlation  between  the  expression  of  Snail  and 
RingIB  in  PDAC  was  analyzed  by  the  Spearman  rank  corre¬ 
lation  coefficient  test.  Spearman  p  are  categorized  as  moderate 
to  strong  correlations  according  to  Dancey  and  Reidy's  cate¬ 
gorization:  0  (zero);  0.1  to  0.3  (weak);  0.4  to  0.6  (moderate);  0.7 
to  0.9  (strong);  and  1  (perfect;  ref.  21).  The  postoperative 
survival  of  patients  with  PDAC  was  analyzed  by  Kaplan-Meier 
estimator  and  tested  by  the  log-rank.  P  value  of  <0.05  was 
considered  statistically  significant.  Statistical  analysis  was 
performed  using  IBM  SPSS  Statistics  Version  20  software. 

Study  approval 

The  study  was  approved  by  the  Ethic  and  Research  Com¬ 
mittees  of  Ruijing  Hospital,  Shanghai  Jiaotong  University 
School  of  Medicine  and  was  conducted  in  accordance  with 
the  Declaration  of  Helsinki  Principles.  The  procedures  for 
pancreatic  tumor  resection  were  described  in  detail  to  all 
patients  before  admission,  and  informed  consent  of  patients 
was  obtained. 

Results 

Expression  of  Snail  and  RingIB  proteins  in  PDAC  is 
positively  correlated 

To  examine  the  clinical  relevance  of  Snail,  RingIB  and  level 
of  H2AK119Ubl  in  PDAC,  IHC  assays  were  performed  on 
human  pancreatic  ductal  tumor  resection  specimens  and 
paired  peri-tumor  tissues  were  used  as  normal  controls.  The 
staining  for  Snail  was  strong  in  tumor  cells  surrounding  acinus 
and  lumen,  but  was  barely  detected  in  epithelial  cells  of  normal 
tissues  (Fig.  1A,  top).  Similarly,  the  staining  for  RingIB  was 
strong  in  tumor  cells  especially  in  high-grade  differentiated 
cells,  whereas  in  normal  tissues  the  staining  was  weak  or  not 
detectable  except  in  islet  cells  that  showed  positive  staining 
(Fig.  1A,  middle).  Notably,  the  staining  for  H2AK119Ubl  was 
strong  in  tumor  cells,  but  remained  low  level  in  islet  cells  and 
acinar  cells  of  the  normal  tissues  (Fig.  1A,  bottom). 

To  further  elucidate  the  clinical  correlation  of  Snail,  RingIB, 
and  level  of  H2AK119Ubl  in  PDAC,  we  performed  IHC  on  TMA 
chips  containing  strictly  selected  60  cases  of  paired  tumor  and 
peri-tumor  specimens  of  PDAC.  Of  the  60  patients,  38  cases  are 
males  and  22  cases  are  females  with  an  average  age  of  62  ±  11 
years  old.  These  patients  have  undergone  pancreatectomy 
between  September  2004  and  December  2008,  and  the  com¬ 
plete  postoperative  follow-up  was  finished  on  December  2011, 
with  duration  of  3  to  7  years.  The  overall  median  survival  time 
of  60  patients  after  pancreatectomy  was  15  months.  The 
McNemar  test  was  conducted  to  analyze  the  symmetrical 
distribution  of  the  IHC  staining  for  Snail,  RingIB,  and 
H2AK119Ubl  among  the  60  cases  of  samples  according  to  the 
IRS  scores  (high:  >4  vs.  low:  <4).  We  found  that  specimens 
showing  high  levels  of  Snail,  RingIB,  and  H2AK119Ubl  were 
distributed  asymmetrically  with  much  more  cases  found  in 
tumor  samples  (Fig.  IB,  P  <  0.05).  We  next  performed  the 
Spearman  rank  correlation  coefficient  analysis  to  determine 
whether  elevated  expression  of  Snail  and  RingIB  in  PDAC 
samples  cooccurs.  Indeed,  there  was  a  positive  correlation 


between  Snail  and  RingIB  (Fig.  1C,  P  <  0.05).  Taken  together, 
these  data  suggest  Snail  and  RingIB  may  functionally  correlate 
in  PDAC  development  and  progression. 

High  level  of  H2AK119Ubl  predicts  poor  survival  of 
patients  with  PDAC 

To  determine  whether  there  is  a  correlation  between  high 
levels  of  Snail,  RingIB,  H2AK119Ubl  and  survival  rate  of 
postoperative  patients  with  PDAC,  Kaplan-Meier  analysis  was 
conducted.  We  found  that  patients  with  PDAC  expressing  high 
level  of  Snail  and  high  level  of  H2AK119Ubl  showed  tendency 
of  higher  mortality  and  was  positively  correlated  with  poor 
survival  (Fig.  ID,  median  survival  time:  9  months;  Fig.  IF, 
median  survival  time:  11  months);  expression  level  of  RingIB 
displayed  similar  tendency,  but  was  not  statistically  significant 
(Fig.  IE).  Remarkably,  patients  with  PDAC  showing  both  high 
level  of  Snail  and  RingIB  strongly  predicted  poor  survival 
(Fig.  1G,  median  survival  time:  9  months).  Taken  together, 
these  data  indicate  that  high  level  of  H2AK119Ubl  in  PDAC  is 
a  potential  epigenetic  biomarker  for  prediction  of  PDAC 
prognosis. 

RinglA  and  RingIB  are  Snail-interacting  proteins 

To  examine  if  there  is  a  physical  interaction  between  Snail 
and  RingIB,  we  stably  expressed  N-terminal  epitope-tagged 
Snail  protein  (Flag-Snail)  in  HEK-293T  cells,  and  performed 
affinity  purification  of  the  Snail-interacting  proteins  with  Flag 
antibody.  The  coeluted  proteins  along  with  Snail  were  identi¬ 
fied  by  mass  spectrometry  analysis.  Notably,  RingIB  was  found 
in  the  protein  band  migrating  at  43  kDa  (Fig.  2A).  Next,  we 
coexpressed  HA-Snail  and  Flag-RinglB  in  HEK-293T  cells  and 
performed  co-IP  assays  to  verify  their  interaction.  Indeed,  Snail 
and  RingIB  interacted  with  each  other  (Fig.  2B).  RinglA  also 
interacted  with  Snail  (Fig.  2C).  To  further  confirm  the  inter¬ 
action  of  Snail  with  RingIB,  we  attempted  to  detect  the 
endogenous  interaction.  The  endogenous  RingIB  and  Snail 
were  readily  detected  by  Western  blot  assays  using  specific 
antibodies  in  PanCl  cells.  Co-IP  assays  indicated  RingIB 
interacted  with  Snail  at  endogenous  level  (Fig.  2D).  Collectively, 
these  data  clearly  demonstrate  that  RinglA  and  RingIB  are 
novel  Snail-interacting  proteins. 

RinglA  and  RingIB  bind  to  the  carboxyl  tandem  zinc 
fingers  of  Snail 

To  identify  the  regions  in  the  Snail  protein  that  interact 
with  RinglA  and  RingIB,  plasmids  encoding  the  N-terminal 
SNAG  domain  (Snail-NT)  or  C-terminal  zinc  fingers  (Snail- 
CT)  were  constructed  respectively  and  were  expressed  in 
HEK-293T  cells  alone  with  RinglA  or  RingIB  (Fig.  3A).  Co-IP 
assays  demonstrated  that  the  zinc  finger  region  retained  the 
binding  activity  with  RinglA  and  RingIB  (Fig.  3B),  whereas 
the  SNAG  domain  and  the  linker  region  of  Snail  showed  no 
binding  activity. 

The  RING  domains  of  RinglA  and  RingIB  are  required 
for  the  interaction  with  Snail 

To  identify  the  regions  of  RinglA  and  RingIB  responsible 
for  the  interaction  with  Snail,  we  generated  a  series  of 
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Figure  1.  High  expression  of  Snail 
and  Ringl  B  is  positively  correlated 
and  predicts  poor  prognosis  in 
PDAC.  A,  IHC  staining  to  detect 
Snail,  Ringl  B,  and  H2AK1 1 9Ub1  in 
paired  human  PDAC  specimens 
(tumor  vs.  peri-tumor).  Arrows, 
positively  stained  cells,  a  and  d, 
duct  epithelial  cells;  b,  islet  cells; 
c,  acinar  cells;  e,  tumor  cells. 
Original  magnifications,  x40. 

B,  the  McNemar  test  of  the  IHC 
staining  of  60  cases  of  the  paired 
PDAC  specimens  on  TMA  chips. 
Each  group  was  shown  by  the 
distribution  of  IHC  staining  scores 
for  each  case.  Only  IHC  scores  >  4 
point  (++)  was  considered  high, 
n  =  60;  *,  statistical  significance, 

P  <  0.05.  C,  Spearman  p  analysis  of 
correlation  of  Snail  and  Ringl  B 
expression  in  PDAC.  n  =  60. 
p  >  0,  positive  correlation;  p  <  0, 
negative  correlation.  D-G,  survival 
analysis  of  patients  with  PDAC 
patients  by  Kaplan-Meier  plots  and 
log-rank  tests.  Patients  were 
categorized  by  high  and  low 
expression  of  the  Ringl  B,  Snail, 
and  H2AK1 19Ub1  based  on  IHC 
staining  scores.  H,  high;  L,  low. 
Average  follow-up  time  of  60 
patients  was  26.2  months  (range, 
0-87  months).  The  median  survival 
time  of  each  group  is  as  follows: 
Snail  (L),  33  months;  Snail  (H),  9 
months;  Ringl  B  (L),  31  months; 
Ringl  B  (H),  11  months; 

H2AK1 19Ub1  (L),  31  months; 
H2AK1 1 9Ub1  (H),  1 1  months;  Snail 
(L)  +  Ringl  B  (L),  the  median 
survival  time  cannot  be  estimated 
because  more  than  50%  of  cases 
are  censored  in  this  group;  Snail  (H) 
+  Ringl  B  (H),  9  months. 


deletion  mutants  of  RinglA  (Fig.  3C,  left).  Next,  we  coex¬ 
pressed  the  deletion  mutants  for  RinglA  and  Snail  in  HEK- 
293T  cells  and  performed  co-IP  assays  with  Flag  antibody.  We 
observed  that  the  N-terminal  RinglA  containing  the  con¬ 
served  RING  domain  retained  the  binding  ability  to  Snail  (Fig. 
3C,  right),  but  the  C-terminal  region  of  RinglA  was  unable  to 
bind  Snail. 

To  further  determine  if  the  RING  domain  contributes  to  the 
interaction  with  Snail,  simultaneous  mutations  of  the  66th 
histidine  (H66A)  and  69th  cysteine  (C69A)  to  alanines  in 
RinglA  were  made  to  disrupt  the  RING  structure  (Fig.  3D). 


Co-IP  assays  showed  that  the  double  mutation  of  the  key 
residues  H66  and  C69  of  RinglA  resulted  in  loss  of  the  binding 
activity  to  Snail  (Fig.  3D),  suggesting  that  the  RING  domain  is 
critical  for  Snail  binding.  Similarly,  mutation  of  H69  and  C72  to 
alanines  in  the  RING  domain  of  RinglB  abolished  its  binding  to 
Snail  (Fig.  3E).  Further,  we  showed  that  mutations  of  these  core 
amino  acid  residues  in  the  RING  domains  did  not  change  the 
subcellular  localization  of  RinglA  and  RinglB  proteins  (Sup¬ 
plementary  Fig.  SI).  Taken  together,  these  results  demonstrate 
that  the  RING  domains  of  RinglA  and  RinglB  are  required  for 
their  interaction  with  Snail. 
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Figure  2.  Ringl  A  and  RingIB  are  Snail-interacting  proteins.  A,  mass  spectrometry  analysis  identifies  RingIB  as  a  Snail-interacting  protein.  Affinity 
purification  of  lysate  from  HEK-293T  cell  stably  expressing  Flag-Snail  was  performed  with  Flag  antibody.  SDS-PAGE  and  colloidal  staining  show  proteins 
coeluted  with  Snail.  Mass  spectrometry  analysis  discovered  the  band  migrating  at  43  kDa  contains  RingIB  and  the  major  band  migrating  at  34  kDa  is 
Flag-Snail  protein.  B,  RingIB  interacts  with  Snail.  Lysates  from  HEK-293T  cells  transfected  with  plasmids  encoding  HA-Snail  and/or  Flag-RingIB  are 
incubated  with  Flag  M2  beads,  and  the  coeluted  proteins  were  blotted  with  HA  antibody.  C,  Ringl  A  interacts  with  Snail.  D,  Snail  interacts  with  Ringl  B  at 
endogenous  level.  Lysates  from  PanCI  cells  were  incubated  with  anti-RingIB,  and  the  coeluted  proteins  were  blotted  with  Snail  antibody.  Ten  percent 
of  the  input  was  loaded,  a,  anti;  WB,  Western  blot. 


Ringl  A  and  RingIB  are  required  for  Snail-mediated 
E-cadherin  gene  repression  and  cell  migration 

To  examine  the  role  of  Ringl  A  and  RingIB  in  Snail- 
mediated  transcriptional  repression,  we  first  depleted  their 
expression  in  PanCI  cells,  a  widely  used  pancreatic  cancer 
cells,  using  specific  shRNAs  targeting  RinglA  and  RingIB 
simultaneously  to  overcome  the  redundancy  effect  (Fig.  4A). 
Western  blot  assays  indicated  that  depletion  of  RinglA  and 
RingIB  resulted  in  global  decrease  of  H2AK119Ubl,  but  no 
apparent  change  on  H3K27Me3  (Fig.  4A).  qRT-PCR  assays 
showed  that  expression  of  the  Snail  target  genes  E-cadherin 
and  Cyclin  D2  and  the  known  RinglA/B  target  genes  HOXC5 
and  HOXB4  was  increased  in  PanCl-shRinglA/B  cells  (Fig. 
4B  and  Supplementary  Fig.  S2).  Next,  we  stably  expressed 
Flag-Snail  in  PanCI -shLuc  or  PanCI -shRinglA/B  cells  using 
lentiviral  system  (Fig.  4C).  The  expression  of  Snail,  RinglA, 
and  RingIB  was  validated  by  Western  blot  assays.  The 
protein  and  mRNA  levels  of  E-cadherin  were  examined  by 
Western  blot  and  qRT-PCR  approaches.  As  expected,  expres¬ 
sion  of  Snail  markedly  reduced  both  the  protein  and  mRNA 
levels  of  E-cadherin  in  PanCI -shLuc  cells,  but  still  slightly 
decreased  the  E-cadherin  expression  in  PanCI -shRinglA/B 
cells  (Fig.  4C  and  D).  In  addition,  we  observed  similar  effect 
of  depletion  of  RinglA  and  RingIB  on  Snail-mediated  tran¬ 
scriptional  repression  in  another  pancreatic  cancer  AsPCl 
cells  (Supplementary  Fig.  S3). 

To  examine  the  role  of  RinglA  and  RingIB  in  Snail-induced 
cell  migration,  we  performed  standard  transwell  assays  to 


measure  the  migration  capability  of  the  PanCI  cells  and  its 
derivatives.  Expression  of  Snail  robustly  enhanced  migration 
capability  of  the  PanCl-shLuc  cells.  However,  expression  of 
Snail  in  PanCI -shRinglA/B  cells  increased  cell  migration,  but 
at  lesser  extent  compared  with  that  of  in  PanCl-shLuc-Snail 
cells  (Fig.  4E  and  F).  Collectively,  these  data  suggest  that 
RinglA  and  RingIB  are  essential  for  Snail  to  maximally  repress 
E-cadherin  and  to  induce  cell  migration. 

Snail  recruits  RinglA  and  RingIB  to  the  promoter  locus 
of  E-cadherin 

To  examine  if  Snail  recruits  RinglA  and  RingIB  to  the  E- 
cadherin  promoter,  we  performed  ChIP  assays  in  PanCI  - 
derivated  cells  with  antibodies  specific  to  Snail,  RinglA, 
RingIB,  or  H2AK119Ubl.  The  coeluted  DNA  fragments  were 
amplified  using  primer  pairs  flanking  the  E-boxes  located  in 
the  proximal  promoter  of  E-cadherin  (Fig.  5A).  In  PanCl- 
shLuc  cells,  expression  of  Snail  resulted  in  an  increased 
binding  of  Snail,  RinglA,  and  RingIB  to  the  E-cadherin 
promoter,  and  a  concomitant  increase  in  H2AK119Ubl 
modification  at  this  locus  (Fig.  5B-E).  Depletion  of  RinglA 
and  RingIB  simultaneously  resulted  in  loss  of  their  binding 
to  the  E-cadherin  promoter,  diminished  H2AK119Ubl  mod¬ 
ification,  and  decreased  binding  of  Snail  to  the  E-cadherin 
promoter,  suggesting  RinglA  and  RingIB  may  stabilize 
Snail/DNA  complexes  at  the  target  promoter.  However, 
depletion  of  RinglA  and  RingIB  did  not  affect  EZH2  binding 
and  H3K27  methylation  at  the  E-cadherin  promoter  locus 


www.aacrjournals.org 


Cancer  Res;  74(16)  August  15,  2014 


4357 


44 


Chen  et  al. 


156 

: 


151 

E 


**  Deletion  mutants  of  Snail 

Snail-WT(1-264)  |snag| 

1  21 

Snail-NT(1-151)  I 

1 

Snail-CT(151-264) 

B 

Plasmid: 

RingIB-Flag: 

Snail-WT-HA: 

Snail-NT-HA: 

Snail-CT-HA: 

kDa 

IP:a-Flag  34- 
WB:oc-HA 
Input: 

WB:oc-Flag  43- 
WB:oc-HA  34-1 

Zb- 

17- 
10- 


|  Zn  Finger  | 


-  +  +  +  + 

+  -  + 

+ 

+ 


-  Snail-WT 
-"  Snail-CT 
-•  RingIB 

-  Snail-WT 
-Snail-NT 
-"  Snail-CT 


1  2  3  4  5 


Deletion  mutants  of  RinglA 
Ring  1A-WT(1 -406)  f  |ring|  |nls| 


1  47 

91  201  204 

406 

Ringl  A-NT(1  -234)  f 

1 

1 

234 

Ringl  A-51  (96-406) 

1 

“1 

96 

406 

Ringl  A-52(1 69-406) 

1 

“1 

Ring  1  A-CT  (234-406) 

169 

r 

406 

1 

234 

406 

RING  finger  (C3HC4)  in  RinglA: 
WT:CPICLDMLKNTMTTKECL  RF  SDC 
66  69 

Ringl  A-H66A,C69A 


kDa  ■ 
34- 


34-  * 


—  -"  Snail 

—  —  -"Snail 


12  3  4 


Plasmid: 

RinglA-Flag: 

Snail-WT-HA: 

Snail-NT-HA: 

Snail-CT-HA: 

kDa 

IP:a-Flag  34- 
WB:oc-HA 

.  *  10" 

Input: 

WB:a-Flag  55- 
WB:oc-HA  It 

W. 

10- 


+  + 
-  + 


1  2  3  4  5 


-  Snail-WT 
-•  Snail-CT 
-"  RinglA 
- Snail-WT 

-Snail-NT 
-«  Snail-CT 


Plasmid: 

Snail-HA: 

Ringl  A-WT-Flag: 
Ringl  A-NT-Flag: 
Ringl  A-51 -Flag: 
Ringl  A-82-Flag: 
Ringl  A-CT-Flag: 
IP:a-Flag  kDa 
WB:oc-HA 
Input: 

WB:oc-HA 


WB:a-Flag 


+  -  +  + 
+  + 

+ 


34- 


-"  Snail 


-•Ringl  A- WT 
Ringl  A-51 
-"  Ringl  A-52 
◄  Ringl  A-NT 
^  Ringl  A-CT 


MT: 

Plasmid: 

Snail-HA:  +  -  +  + 

Ringl  A-WT-Flag:  +  + 

Ringl  A-MT-Flag:  + 

IP:a-Flag 
WB:a-HA 
Input: 

WB:oc-HA 

WB:oc-Flag  55_[  — - -  Ringl  A-WT&MT  WB:a-Flag 


RING  finger  (C3HC4)  in  RingIB: 
WT:CPICLDMLKNTMTTKECL  RF  ADC 
69  72 

MT:  Ringl  B-H69A.C72A 

Plasmid: 

Snail-HA:  +  -  +  + 

Ringl  B-WT-Flag:  +  + 

RingIB-MT-Flag:  + 

IP:oc-Flag  kDa  I  ^  l-Cnaii 

WB:a-HA  34-^_^_J-Snail 

wl-a-HA  34-|®  —  —  |— Snail 


43- 


n -"Ringl  B-WT&MT 


12  3  4 


Figure  3.  A,  the  RING  domains  of 
RinglA  and  Ringl  B  and  the  zinc 
fingers  of  Snail  are  required  for  their 
interaction.  Schematic 
representation  of  deletion  mutants 
of  Snail.  B,  the  C-terminal  zinc 
fingers  of  Snail  interact  with  RinglA 
and  Ringl  B.  Lysates  prepared 
from  HEK-293T  cells  transfected 
with  plasmids  encoding  HA-Snail 
and  its  truncation  mutants, 
together  with  Flag-Ringl  A  or  Flag- 
RinglB  were  incubated  with  anti- 
Flag  M2  beads,  and  coeluted 
proteins  were  analyzed  by  anti-HA. 
C,  N-terminal  (1-96  aa)  of  RinglA 
interacts  with  Snail  schematic 
representation  of  deletion  mutants 
of  RinglA.  N-terminus  of  RinglA 
contains  a  conserved  RING  finger 
domain.  D,  mutation  of  the  66th 
histidine  and  69th  cysteine  in  the 
RING  finger  of  RinglA  abolishes 
the  binding  between  Snail  and 
RinglA.  E,  mutation  of  the  69th 
histidine  and  No.  72  cystenie  to 
alanines  in  the  RING  finger  of 
RingIB  abolishes  the  interaction 
of  Snail  and  RingIB.  WT,  wild- 
type;  MT,  mutant-type;  NT,  Amino 
terminus;  CT,  carboxyl  terminus. 


(Fig.  5F  and  G).  Taken  together,  these  data  demonstrate  that 
Snail  recruits  RinglA  and  RingIB  to  modify  H2A  at  K119  at 
the  target  chromatin  region. 

Depletion  of  EZH2  reduces  Snail/RinglA/B  multiprotein 
complex  binding  at  the  E-cadherin  promoter  locus 

Previous  studies  demonstrated  that  EZH2  binds  to  the 
SNAG  domain  of  Snail  to  repress  E-cadherin  expression  by 
trimethylation  of  H3  at  K27  (22, 23).  To  examine  if  EZH2  affects 
Ringl A/B-mediated  H2AK119Ubl  modification,  we  generated 
PanCl -shEZH2  and  PanCl -shLuc  cells.  Depletion  of  EZH2 
resulted  in  markedly  global  decrease  of  H3K27Me3  and  slight 
decrease  of  H2AK119Ubl  (Fig.  6A).  Similar  to  that  observed  in 
PanCl -shRinglA/B  cells,  depletion  of  EZH2  increased  E-cad¬ 


herin  expression  as  evidenced  by  Western  blot  and  indirect 
immunofluorescence  assays  (Fig.  6B  and  C).  Consistently, 
expression  of  Snail  in  PanCl-shLuc  cells  effectively  repressed 
E-cadherin,  but  only  showed  weak  repression  on  E-cadherin 
expression  in  PanCl-shEZH2  cells  (Fig.  6B  and  C),  indicating 
EZH2  is  important  for  Snail-mediated  E-cadherin  repression. 
Next,  ChIP  assay  was  performed  in  these  resulting  cells. 
Depletion  of  EZH2  in  PanCl  cells  decreased  the  binding  of 
Snail,  RinglA,  and  RingIB  to  the  E-cadherin  promoter,  dimin¬ 
ished  H3K27Me3,  and  greatly  reduced  H2AK119Ubl  modifica¬ 
tions  induced  by  Snail  at  this  locus  (Fig.  6D-I).  These  observa¬ 
tions  indicate  that  EZH2  functions  upstream  of  RinglA  and 
RingIB  and  promotes  Snail  to  recruit  RinglA  and  RingIB  more 
efficiently  at  the  E-cadherin  promoter  locus. 
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Figure 4.  RinglA  and  RingIB  are  required  for  Snail-mediated  E-cadherin  gene  repression  and  cell  migration.  A,  simultaneous  depletion  of  RinglA  and 
RingIB  in  PanCI  cells  results  in  global  decrease  of  H2AK119Ub1 .  Double  depletion  of  RinglA  and  Ringl  B  in  PanCI  cells  was  made  using  two  sets  of 
specific  shRNAs  targeting  RinglA  and  RingIB,  respectively,  and  created  two  pools  of  PanCI -shRinglA/B  cells.  Western  blots  were  performed  to 
examine  the  expression  of  RinglA,  RingIB,  H2AK119Ub1,  and  H3K27Me3.  B,  E-cadherin  is  upregulated  in  PanCI -shRinglA/B  cells.  Total  RNAs 
were  isolated  from  the  resulting  PanCI  cells,  and  expression  of  Snail  and  E-cadherin  was  examined  by  qRT-PCR.  #1  and  #2,  different  cell  pools 
generated  from  RinglA  and  RingIB  double  knocking  down  by  shRNAs;  GAPDH  was  used  as  loading  reference.  Data,  mean  ±  SD  from  three 
independent  experiments.  *,  statistical  significance,  P  <  0.05.  NS,  statistical  insignificance,  P  >  0.05.  C,  Snail  was  stably  expressed  in  PanCI  -shLuc  or 
PanCI -shRinglA/B  cells.  Plu-Snail-Flag  was  introduced  into  the  resulting  PanCI  cells  vial  lentiviral  infection,  and  the  cells  were  sorted  by  FACS. 
The  expression  of  Snail-Flag,  RinglA,  Ringl  B,  and  E-cadherin  was  examined  by  Western  blot  assays.  D,  depletion  of  Ringl  A  and  Ringl  B  compromises 
Snail's  repression  on  E-cadherin.  qRT-PCR  assays  were  conducted  to  examine  the  mRNA  of  E-cadherin.  GAPDH  was  used  as  loading  reference. 

E,  Snail  induces  cell  migration  in  transwell  assays.  F,  statistical  analysis  for  the  migrating  cells  from  transwell  assays  was  shown  in  the  bar  graphs.  Nine 
randomly  chosen  fields  were  counted  for  statistic  analysis.  Data,  mean  ±  SD  from  three  independent  experiments.  #1  and  #2,  different  cell 
pools  generated  from  RinglA  and  RingIB  double  knocking  down  by  shRNAs;  *,  P  <  0.05. 


EZH2  and  RingIB  form  distinct  complexes  with  Snail 

To  determine  if  EZH2,  Snail,  and  RingIB  form  one  func¬ 
tional  complex,  we  coexpressed  HA-Snail,  Myc-EZH2,  and 
Flag-RinglB  in  HEK-293T  cells  and  performed  co-IP  assays 
with  Flag  antibody.  When  these  three  proteins  were  coex¬ 
pressed,  RingIB  coimmunoprecipitated  Snail,  but  not  EZH2, 


indicating  Snail,  EZH2,  and  RingIB  do  not  coexist  in  the  same 
complex  (Fig.  7A).  To  further  strengthen  this  observation,  we 
performed  size  exclusion  fractionation  of  whole-cell  extracts 
prepared  from  HEK-293T  cells  expressing  Snail,  RingIB,  and 
EZH2.  EZH2  was  eluted  in  a  single  peak  (fractions  12-14), 
RingIB  was  detected  in  fraction  15  and  was  peaked  at 
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Figure  5.  A,  Snail  recruits  RinglA 
and  Ringl  B  to  modify  H2A  at  K1 1 9 
at  the  target  chromatin  region. 
Schema  shows  PCR  primers  used 
for  amplifying  the  E-cadherin 
promoter  flanking  the  functional 
E-boxes.  B-G,  ChIP  DNA 
fragments  from  PanCI  cells  were 
analyzed  by  qPCR  assays.  For 
comparison  between  cells,  PCR 
amplifications  were  normalized  to 
their  inputs  respectively.  The  units 
on  the  y  axis  were  arbitrary;  data, 
mean  ±  SD  from  three  independent 
experiments.  *,  P  <  0.05.  NS, 
statistical  insignificance,  P  >  0.05. 
Enrichment  of  Snail  (B), 

H2AK1 19Ub1  (C),  RinglA  (D),  and 
Ringl  B  (E)  at  the  E-cadherin 
promoter  is  increased  when  Snail  is 
expressed,  whereas  depletion  of 
RinglA  and  Ringl  B  results  in  loss 
of  Snail  occupancy  and  diminished 
H2AK119Ub1  modification  at  the 
E-cadherin  promoter.  Snail 
increases  EZH2  binding  and 
H3K27Me3  at  E-cadheirn  promoter 
locus,  but  depletion  of  Ringl  A  and 
Ringl  B  does  not  affect  EZH2 
binding  (F)  and  FI3K27Me3 
modification  (G). 


fractions  16  to  17;  Snail  was  eluted  in  one  peak  (fractions  12- 
16;  Fig.  7B).  These  observations  indicate  that  Snail  forms  two 
complexes,  one  with  EZH2  and  another  one  with  Ringl B. 
Together,  these  data  suggest  that  EZH2,  Snail,  and  Ringl B  do 
not  coexist  in  the  same  protein  complex;  rather,  EZH2  and 
RinglB  form  two  distinct  protein  complexes  with  Snail 
respectively  (Fig.  7C). 

Discussion 

PDAC  is  one  of  the  most  malignant  human  cancers,  with  a 
5-year  survival  rate  of  only  5%  and  a  median  survival  of  less 
than  6  months  (24,  25).  Almost  all  patients  with  PDAC 
develop  metastases  due  to  its  properties  of  rapid  progres¬ 
sion,  potent  invasiveness,  and  profound  resistance  to  treat¬ 
ments.  Identifying  biomarkers  for  early  diagnosis  and  prog¬ 
nosis  is  especially  important  to  reduce  the  death  rate  of 
patients  with  PDAC. 

H2AK119Ubl  modification  is  important  during  the  embry¬ 
onic  development  and  organogenesis  (12-15);  however,  the 
biologic  significance  of  H2AK119Ubl  in  tumor  progression 
has  not  yet  been  elaborated.  In  this  study,  we  found  that 


H2AK119Ubl  is  markedly  elevated  in  tumor  cells  of  PDAC, 
and  the  level  of  H2AK119Ubl  is  positively  associated  with  poor 
survival,  indicating  that  H2AK119Ubl  is  a  potential  epigenetic 
biomarker  for  diagnosis  and  prognosis  of  PDAC.  However,  we 
observed  that  levels  of  Snail,  RinglB,  and  H2AK119Ubl  are  not 
correlated  in  some  cases  of  PDAC  specimens;  for  example,  of 
the  60  cases,  36  cases  show  Snail  (H),  37  cases  show  RinglB  (H), 
but  only  28  cases  show  both  Snail  (H)  and  RinglB  (H;  Fig.  1). 
The  discrepancy  between  the  level  of  Snail,  RinglB,  and 
H2AK119Ubl  modification  suggests  additional  mechanisms 
that  are  independent  of  the  Snail/RinglB  complex  and  might 
be  involved  in  Snail  and  RinglB  functions  as  well  as 
H2AK119Ubl  modification  in  PDAC.  To  clarify  this  puzzle, 
ChIP-seq  analyses  of  the  occupancy  of  Snail  and  RinglB,  and 
the  loci  with  high  level  of  H2AK119Ubl  in  the  genome  of  PDAC 
specimens,  will  help  to  obtain  global  understanding  of  Snail, 
RinglB,  and  H2AK119Ub  1-dependent  or  -independent  repres¬ 
sive  pathways. 

A  number  of  the  SNAG-associated  histone  modification 
complexes  such  as  Sin3A-HDAC  1  /HDAC2,  EZH2/SUZ12, 
LSD  1 -CoREST,  and  Ajuba-PRMT5  have  been  identified  (6- 
10),  but  how  these  complexes  are  orchestrated  and 
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Figure  6.  Depletion  of  EZH2  reduces  Snail/Ringl  A/B  multiprotein  complex  binding  at  the  E-cadherin  promoter  locus.  A,  depletion  of  EZH2  in  PanCI  cells  by 
specific  shRNAs  results  in  global  decrease  of  H3K27Me3  and  elevated  E-cadherin.  Western  blot  showed  EZH2  is  effectively  depleted  in  PanCI  cells,  global 
H3K27Me3  is  reduced,  and  H2AK119Ub1  is  partially  decreased  in  these  cells.  #1  and  #2,  different  cell  pools  generated  from  EZH2  knocking  down  by 
two  specific  shRNAs.  B,  Snail-Flag  is  stably  expressed  in  PanCI -shLuc  or  PanCI -shEZH2  (pool  #2  in  A)  cells.  Expression  of  Snail  effectively  represses 
E-cadherin  in  PanCI  -shLuc  cells,  but  only  weakly  represses  E-cadherin  in  PanCI  -shEZH2  cells.  Whole-cell  lysate  (1 00  |ug)  was  loaded  and  p-actin  was  used 
as  loading  control.  C,  immunofluorescence  staining  (green)  of  E-cadherin.  E-cadherin  is  increased  in  PanCI -shEZH2  cells.  Expression  of  Snail  markedly 
decreases  E-cadherin  in  PanCI -shLuc  cells,  but  only  slightly  reduces  E-cadherin  in  PanCI -shEZH2  cells.  Arrows,  E-cadherin  localization  on  cell 
boundary.  Original  magnification,  x63.  D-l,  ChIP  DNA  fragments  from  PanCI -shLuc  or  PanCI -shEZH2  cells  overexpressing  Snail  were  analyzed  by  qPCR 
assays.  PCR  amplifications  were  normalized  to  their  inputs  respectively.  Enrichment  of  Snail  enhances  the  binding  of  EZH2  (E),  Ringl  A  (G),  and  Ringl  B  (H) 
to  E-cadherin  promoter,  increases  H3K27Me3  (F)  and  H2AK119Ub1  (I)  levels,  whereas  depletion  of  EZH2  decreases  binding  of  Snail,  Ringl  A,  and 
RingIB  to  E-cadherin  promoter,  abolishes  H3K27Me3,  and  decreases  H2K119Ub1.  The  units  on  they  axis  are  arbitrary.  Data,  mean  ±  SD  from  three 
independent  experiments.  *,  P  <  0.05.  NS,  without  statistical  significance,  P  >  0.05. 


assembled  at  the  target  chromatin  is  largely  unknown.  Here, 
we  provided  an  example  that  how  the  SNAG-associated 
EZH2  and  the  zinc  finger-associated  RinglA/B  complexes 
cooperate  at  the  target  chromatin  to  regulate  gene  expres¬ 
sion.  EZH2  can  trimethylate  H3  at  K27,  which  is  required  for 
Snail-dependent  E-cadherin  repression  during  cancer  pro¬ 
gression  (22,  23).  Consistently,  we  showed  that  depletion  of 
EZH2  in  PanCI  cells  results  in  loss  of  Snail-mediated  repres¬ 


sion  on  E-cadherin,  and  a  concomitant  decrease  in  the 
binding  of  Snail,  RinglA,  and  RingIB  to  the  E-cadherin 
promoter,  indicating  that  EZH2  functions  upstream  of 
RinglA/B  and  promotes  Snail  to  recruit  RinglA  and  RingIB 
more  efficiently  at  the  E-cadherin  promoter  locus.  This 
observation  is  well  supported  by  previous  findings  that 
H3K27me3  is  thought  to  involve  in  the  initiation  of  gene 
repression  and  serve  as  a  docking  site  for  the  recruitment  of 
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Figure  7.  EZH2  and  Ringl  B  form  two  distinct  complexes  with  Snail.  A,  Snail,  Ringl  B,  and  EZH2  do  not  coexist  in  the  same  complex.  Lysates  from  HEK-293T 
cells  transfected  with  plasmids  of  HA-Snail,  Flag-Ringl  B,  and  Myc-EZH2  were  incubated  with  anti-Flag  M2  beads,  and  coeluted  proteins  were  analyzed  by 
HA  and  Myc  antibodies.  Co-IP  showed  that  Ringl  B  does  not  interact  with  EZH2.  B,  size  exclusion  fractionation  of  whole-cell  extracts  prepared  from 
HEK-293T  cells  expressing  HA-Snail,  Flag-Ringl  B,  and  Myc-EZH2  was  performed  on  Superose  6 1 0/300  GL  column,  and  fractions  of  proteins  were  collected 
at  1  mL  per  fraction  at  0.35  mL  per  minute  in  the  cold  room.  Western  bolt  analysis  of  the  proteins  from  each  fraction  with  HA,  Myc,  or  Flag  antibodies 
showed  that  Ringl  B  and  EZH2  form  two  distinct  complexes  with  Snail.  C,  model  for  Snail-mediated  repression.  Snail  recruits  Ringl  A  and  Ringl  B  via  the 
carboxyl  tandem  zinc  fingers  to  monoubiquitylate  H2AK119,  whereas  Snail  recruits  EZH2  via  SNAG  domain  to  trimethylate  H3K27,  which  may  enhance 
Snail/Ringl  A/B  complex  recruitment.  Snail/Ringl  A/B  and  Snail/EZH2  complexes  may  bind  to  different  E-boxes  or  dynamically  bind  to  the  same  E-boxes 
at  the  target  promoters. 


PRC1  proteins  (26-29),  whereby  PRC1  is  recruited  to  main¬ 
tain  the  stable  repression  of  genes.  However,  one  remaining 
question  is  that  how  exactly  Snail/EZH2  and  Snail/RinglA/B 
complexes  are  recruited  to  the  target  promoters  because 
these  two  complexes  do  not  coexist.  Notably,  we  also 
observed  that  Snail  occupancy  at  the  target  chromatin  is 
affected  by  RinglA  and  RinglB,  suggesting  RinglA  and 
RinglB  may  stabilize  the  Snail  complex  at  the  target  chro¬ 
matin.  Although  we  showed  that  Snail  binds  to  the  RING 
domains  of  RinglA  and  RinglB,  Snail  is  not  a  substrate  of 
RinglA  and  RinglB  (unpublished  data).  The  detailed  mech¬ 
anism  by  which  RinglA  and  RinglB  regulate  chromatin 
binding  of  Snail  is  not  clear  and  needs  to  explore  further. 
Moreover,  we  observed  that  depletion  of  RinglA  and  RinglB 
in  PanCl  cells  abolishes  RinglA  and  RinglB  binding  and 
H2AK119Ubl  modification  at  the  E-cadherin  promoter,  but 
expression  of  Snail  in  PanCl  cells  still  represses  E-cadherin 
and  induces  cell  migration  in  these  cells  (Fig.  4),  indicating 
Snail  may  recruit  additional  repression  complexes  to  repress 
transcription  even  in  the  absence  of  RinglA  and  RinglB.  We 
postulate  that  the  binding  of  Snail-associated  complexes  to 


Snail  target  chromatin  may  be  dynamically  and  temporally 
regulated  during  the  development  and  EMT  events,  which 
need  to  be  dissected  further  in  a  system  so  that  expression  of 
Snail  is  finely  controlled. 

In  summary,  these  observations  demonstrate  that  the 
repressive  activity  of  Snail  is  contributed  not  only  by  SNAG 
domain  but  also  by  the  carboxyl  zinc  fingers;  the  histone 
ubiquitination  also  plays  an  important  role  in  Snail-mediated 
transcriptional  repression.  Together,  this  sheds  new  light  on 
the  epigenetic  machinery  in  Snail-induced  EMT  and  cancer 
metastasis.  These  results  provide  unequivocal  evidence  to 
support  that  RinglA  and  RinglB  could  be  explored  for  new 
therapeutics  and  that  H2AK119Ubl  could  be  a  potential  bio¬ 
marker  of  pancreatic  cancer  prognosis. 
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